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A Fundamental Experiment in Quantum Physics 

The Wave-Particle Dualism of Matter 





When helium atoms all having the same direction and velocity are passed through a double-slit apparatus, 
each atom produces a strictly localised point of impact on a screen behind the slits; the atoms appear to 
be essentially particles. If the experiment is allowed to run for a longer time, so that a large number of im- 
pact points is registered on the screen, then an interference pattern appears, analogous to that seen in 
Young’s double-slit experiment with light; the helium atoms thus behave in this case as waves. The seven 
images show the measured intensity distribution on the screen as a function of time (5' to 42hl8') after 
starting the experiment. This experiment demonstrates the wave-particle dualism of matter in an impres- 
sive fashion. How quantum theory bridges the apparent contradiction: pointlike particle on the one hand, 
extended wave on the other, is a subject treated in this book. These experiments on helium atoms were 
carried out by O. Carnal, J. Mlynek: Phys. Rev. Lett. 66, 2689 (1991) and Ch. Kurtsiefer, T. Pfau, J. Mly- 
nek: Nature 386, 150 (1997). More details are given in Sect. 6.6. 





Preface to the Sixth Edition 



Since a new edition of our book has once again become necessary, we have as be- 
fore taken the opportunity to include the latest developments in atomic and quantum 
physics. These areas continue to yield new and fascinating experimental and theoret- 
ical results which are of fundamental importance and are also extremely interesting 
to students of science. As a result of newly developed experimental methods and 
theoretical techniques, it has also become possible to find solutions to some long-es- 
tablished problems. In this spirit we have added an entire new chapter dealing with 
entangled wavefunctions, the Einstein-Podolsky-Rosen paradox. Bell’s inequalities, 
the paradox of Schrodinger’s cat and the concept of decoherence. In addition, we 
have treated new ideas relating to quantum computers and the numerous quantum- 
physical schemes for constructing them. These new concepts exemplify the rapidly- 
developing area of quantum information. 

Finally, in this new chapter we have included the experimental realisation of the 
Bose-Einstein condensation and of the atom laser, which promise important new ap- 
plications. 

In Chap. 22, “Modem methods of optical spectroscopy”, we have added a new 
section on nondestmctive photon detection as an example of efficient methods for 
investigating the interactions between atoms and photons in resonant cavities. Con- 
sidering the current importance of these areas, we emphasize references to the origi- 
nal literature. These can be found in the Bibliography. 

In treating all of these subjects, we have as usual made an effort to give a read- 
ily understandable description, in line with the tradition of this book. 

Once again, we express our gratitude to those students, colleagues and other 
readers of the book who have made a number of suggestions for its improvement. 
Our special thanks go to our colleagues Th. Hansch, J. Mlynek and T. Pfau for pro- 
viding us with coloured figures of their newest experimental results. We thank Ms. 
Irmgard Moller for her quick and careful preparation of the new parts of the manu- 
script. We are grateful to Springer- Verlag, in particular Dr. H.J. Kolsch and Mr. 
C.-D. Bachem for their efficient cooperation as always, and Prof. W. D. Brewer for 
his excellent translation of the new chapters. 



Stuttgart, March 2000 



H, Haken H. C. Wolf 




Preface to the Fourth Edition 



This fourth edition contains a few additional figures. Otherwise only typographical er- 
rors have been removed. 

The final chapter on Fundamentals of the Quantum Theory of Chemical Bonding 
is continued in an extended way in the textbook Molecular Physics and Elements of 
Quantum Chemistry by the same authors. This book contains, in particular, a profound 
presentation of group theory as applied to atoms and molecules. Furthermore, the in- 
teraction between atoms and molecules and light is treated in detail. 

We thank again Springer-Verlag, in particular Dr. H.J. Kdlsch and Mr. C.-D. 
Bachem for their excellent cooperation as always, and Prof. W. D. Brewer for his con- 
tinuous support in translating our German text. 



Stuttgart, February 1994 



H. Haken H. C. Wolf 




Preface to the First Edition 



A thorough knowledge of the physics of atoms and quanta is clearly a must for every 
student of physics but also for students of neighbouring disciplines such as chemistry 
and electrical engineering. What these students especially need is a coherent presenta- 
tion of both the experimental and the theoretical aspects of atomic and quantum phys- 
ics. Indeed, this field could evolve only through the intimate interaction between in- 
genious experiments and an equally ingenious development of bold new ideas. 

It is well known that the study of the microworld of atoms caused a revolution of 
physical thought, and that fundamental ideas of classical physics, such as those on 
measurability, had to be abandoned. But atomic and quantum physics is not only a 
fascinating field with respect to the development of far-reaching new physical ideas. 
It is also of enormous importance as a basis for other fields. For instance, it provides 
chemistry with a conceptual basis through the quantum theory of chemical bonding. 
Modern solid-state physics, with its numerous applications in communication and com- 
puter technology, rests on the fundamental concepts first developed in atomic and 
quantum physics. Among the many other important technical applications we mention 
just the laser, a now widely used light source which produces light whose physical 
nature is quite different from that of conventional lamps. 

In this book we have tried to convey to the reader some of the fascination which 
atomic and quantum physics still gives a physicist studying this field. We have tried 
to elaborate on the fundamental facts and basic theoretical methods, leaving aside all 
superfluous material. The text emerged from lectures which the authors, an experimen- 
talist and a theoretician, have given at the University of Stuttgart for many years. These 
lectures were matched with respect to their experimental and theoretical contents. 

We have occasionally included in the text some more difficult theoretical sections, 
in order to give a student who wants to penetrate further into this field a self-contained 
presentation. The chapters which are more difficult to read are marked by an asterisk. 
They can be skipped on a first reading of this book. We have included chapters impor- 
tant for chemistry, such as the chapter on the quantum theory of the chemical bond, 
which may also serve as a starting point for studying solid-state physics. We have fur- 
ther included chapters on spin resonance. Though we explicitly deal with electron spins, 
similar ideas apply to nuclear spins. The methods of spin resonance play a fundamental 
role in modern physical, chemical and biological investigations as well as in medical 
diagnostics (nuclear spin tomography). Recent developments in atomic physics, such 
as studies on Rydberg atoms, are taken into account, and we elaborate the basic 
features of laser light and nonlinear spectroscopy. We hope that readers will find 
atomic and quantum physics just as fascinating as did the students of our lectures. 

The present text is a translation of the second German edition Atom- and Quanten- 
physik. We wish to thank Prof. W. D. Brewer for the excellent translation and the most 
valuable suggestions he made for the improvement of the book. Our thanks also go 
to Dr. J. V. Schiitz and Mr. K. Zeile for the critical reading of the manuscript, to Ms. 
S. Schmiech and Dr. H. Ohno for the drawings, and to Mr. G. Haubs for the careful 
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Preface to the First Edition 



proofreading. We would like to thank Mrs. U. Funke for her precious help in typing 
new chapters. Last, but not least, we wish to thank Springer-Verlag, and in particular 
H. Lotsch and G.M. Hayes, for their excellent cooperation. 

Stuttgart, February 1984 H, Haken H.C. Wolf 
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List of the Most Important Symbols Used 



The numbers of the equations in which the symbols are defined are given in parenthe- 
ses; the numbers in square brackets refer to the section of the book. The Greek symbols 
are at the end of the list. 



A 


Vector potential 




Hamilton function. 


A 


Amplitude or constant 




Hamiltonian operator 


A 


Mass number (2.2) or area 


Hn 


Hermite polynomial 
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Interval factor or fine struc- 
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Planck’s constant 




ture constant (12.28) and 
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= h/2n 




hyperfine splitting (20.10) 


1,1 


Nuclear angular momentum 




Bohr radius of the H atom in 
its ground state (8.8) 




and corresponding quantum 
number (20.1) 


B 


Magnetic induction 
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Abbreviation for integrals 
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Creation and annihilation opera- 
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tors for the harmonic oscillator 
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Imaginary unit (i = |/^) 
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Constant, impact parameter 
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Total angular momentum of an 
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Constant 




electron shell and correspond- 
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Velocity of light, series expan- 




ing quantum number (17.5) 




sion coefficient 


j,j 


Total angular momentum of 


c.c. 


Complex conjugate 




an electron and corresponding 
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Dipole moment 




quantum number [12.7] 
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Constant 
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Operator for the total angular 


dV 


Infinitesimal volume element 




momentum 
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Electric field strength 
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Boltzmann’s constant, force 
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Wave vector 
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Kinetic energy 
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Potential energy 
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Total energy 




quantum number (17.3) 
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Proton charge 


Ln 


Laguerre polynomial (10.81) 


-e 


Electron charge 
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Orbital angular momentum of 
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Exponential function 




an electron and corresponding 
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Electric field strength (14.1) 




quantum number 


F,F 


Total angular momentum of an 
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Angular momentum operator 




atom, including nuclear angu- 
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1 Mass 




lar momentum and correspond- 
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Magnetic quantum number 




ing quantum number (20.6) 


m, 


- for angular momentum 
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Amplitude of the magnetic in- 


ms 


- for spin 




duction [14.4, 14.5] 


rrij 


Magnetic quantum number for 
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Spring constant 




total angular momentum 
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Lande g factor (12.10, 16, 21, 
13.18, 20.13) 


mo 


Rest mass, especially that of 
the electron 
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N, n Particle number, particle num- 
ber density 

N Normalisation factor 

n Principal quantum number or 

number of photons or an 
integer 

P Spectral radiation flux density 

(5.2) or probability 
P^i Legendre polynomial 

Pf (m 0) Associated Legendre 

function 

p, p Momentum, expectation value 

of momentum 

Q Nuclear quadrupole moment 

( 20 . 20 ) 

Q, q Charge 

R (r) Radial part of the hydrogen 

wavefunction 

r Position coordinate (three-di- 

mensional vector) 
r Distance 

S Resultant spin (17.4) 

S Symbol for orbital angular 

momentum L = 0 

s, s Electron spin and correspond- 

ing quantum number (12.15) 
s Spin operator = Sy, s^) 

T Absolute temperature 

Tx Longitudinal relaxation time 

Ti Transverse relaxation time 

t Time 

u Spectral energy density (5.2), 

atomic mass unit [2.2] 

V Volume, potential, electric 
voltage 

V Expectation value of the 
potential energy 

V Velocity, particle velocity 

X Particle coordinate (one- 

dimensional) 

X Expectation value of position 

0) Spherical harmonic func- 
tions (10.10, 48-50) 

Z Nuclear charge 

a Fine structure constant [8.10] 

or absorption coefficient (2.22) 
P Constant 

r Decay constant 

y Decay constant or linewidth 

gyromagnetic ratio (12.12) 



Laplace operator 

= aVax^+0V0j^^+aV0z^ 

AE Energy uncertainty 
Ak Wavenumber uncertainty 
Ap Momentum uncertainty 
A t Time uncertainty ( = finite 
measurement time) 

A V Finite volume element 
A CO Uncertainty in the angular fre- 
quency 

Ax Position uncertainty 
J(x) Dirac delta function (see mathe- 
matics appendix) 

^ Kronecker delta symbol: 

^ = 1 for = V, y = 0 for 
£ Dimensionless energy (9.83) 

Energy contributions to pertur- 
bation theory 

£o Permittivity constant of 
vacuum 

6 Angle coordinate (10.2) 

K Defined in (10.54) 

A Wavelength (exception: expan- 
sion parameter in [15.2.1, 2 ]) 
Mean free path [2.4.3] 

/I, ju Magnetic moment (12.1) 

ju Reduced mass (8.15) 

//b Bohr magneton (12.8) 

//n Nuclear magneton (20.3) 

V Frequency [8.1] 

V Wavenumber [8.1] 

Dimensionless coordinate (9.83) 

Q Charge density, density of 
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G Scattering coefficient, interac- 

tion cross section (2.16) 

T Torque (12.2) 

0 Phase 
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01 , 0 Spin wavefunctions 

if/ Wavefunction 

^ Wavefunction of several electrons 

Q Generalised quantum 

mechanical operator 
Q Frequency [14.4, 14.5, 15.3] 

CO Angular frequency 2nv, or 
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1. Introduction 



1.1 Classical Physics and Quantum Mechanics 



Atomic and quantum physics, which are introduced in this book, are essentially prod- 
ucts of the first third of this century. The division of classical physics into branches 
such as mechanics, acoustics, thermodynamics, electricity, and optics had to be 
enlarged when - as a consequence of the increasing knowledge of the structure of 
matter - atoms and quanta became the objects of physical research. Thus, in the twen- 
tieth century, classical physics has been complemented by atomic physics and the 
physics of light or energy quanta. The goal of atomic physics is an understanding of the 
structure of atoms and their interactions with one another and with electric and mag- 
netic fields. Atoms are made up of positively charged nuclei and negatively charged 
electrons. The electromagnetic forces through which these particles interact are well 
known in classical physics. 

The physics of atomic nuclei cannot be understood on the basis of these forces 
alone. A new force - the nuclear or strong force - determines the structures of nuclei, 
and the typical binding energies are orders of magnitude larger than those of the elec- 
trons in atoms. The study of nuclei, of elementary particles, and the whole of high 
energy physics thus form their own branches of physics. They will not be treated in this 
book. 



1.2 Short Historical Review 

The word atom comes from the Greek and means “the indivisible”, the smallest com- 
ponent of matter, which cannot be further divided. This concept was introduced in the 
5th and 4th centuries B.C. by Greek natural philosophers. The first atomic theories of 
the structure of matter were those of Democrites (460-370 B.C.), Plato (429-348), 
and Aristotle (384-322). It required more than two millenia until this speculative 
atomism grew into an exact atomic physics in the modern sense. 

The meaning of the word atom becomes less subject to misinterpretation if it is 
translated into Latin: an individuum is the smallest unit of a large set which possesses 
all the essential characteristics of the set. In this sense, an atom is in fact indivisible. 
One can, to be sure, split a hydrogen atom into a proton and an electron, but the 
hydrogen is destroyed in the process. For example, one can no longer observe the 
spectral lines characteristic of hydrogen in its optical spectrum. 

Atomism as understood by modern science was first discovered for matter, then for 
electricity, and finally for energy. 

The atomism of matter, the recognition of the fact that all the chemical elements are 
composed of atoms, followed from chemical investigations. The laws of constant and 
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multiple proportions, formulated by J.L Proust ca. 1799 and by Dalton ca. 1803, 
can be explained very simply by the atomic hypothesis: 

The reaction equations 

14 g nitrogen + 16 g oxygen yield 30 g NO and 
14 g nitrogen + 32 g oxygen yield 46 g NO 2 

mean: the atomic weights of nitrogen and oxygen are related as 14:16. 

Only whole atoms are react with one another. The first atomic model (W Prout, 1815) 
assumed that the atoms of all elements are put together out of hydrogen atoms. As a 
heuristic principle this hypothesis finally led to a scheme for ordering the elements 
based on their chemical properties, the periodic system of L. Meyer and D. L Mendeleev 
(1869). More about this subject may be found in introductory textbooks on chemistry. 

About the same time (1808), it was found by Gay-Lussac that not only the weights 
but also the volumes of gaseous reactants occur as ratios of small integers. In the above 
example, 

1 volume nitrogen + 1 volume oxygen yield 2 volumes NO 
1 volume nitrogen + 2 volumes oxygen yield 2 volumes NO 2 . 

Similar observations led to the hypothesis of Avogadro (1811): Equal volumes of gases 
under similar conditions (pressure, temperature) contain equal numbers of molecules. 

Continued investigations of gases in the course of the 19th century led to the 
atomism of heat, that is, to the explanation of heat in general and of the thermodynam- 
ic laws in particular as consequences of atomic motion and collisions. In about 1870, 
the first theory to encompass a whole branch of physics, the kinetic theory of gases, 
was completed by the physicists Clausius and Boltzmann. 

The atomism of electricity was discovered in 1833 by the English scientist Michael 
Faraday. Based on the quantitative evaluation of exceedingly careful measurements of 
the electrolysis of liquids, he formulated his famous laws: 

The quantity of an element which is separated is proportional to the quantity of 
charge transported in the process, 

and 

various elements are separated into equivalent weights by the same quantity of 
charge. 

From this, Faraday concluded: 

There are “atoms” of electricity - it was only after 70 years that their mass and 
charge could be determined - 

and 

these “atoms” of electricity - the electrons - are bound to atoms of matter. 

The discovery of the atomism of energy can be dated exactly: on December 14, 
1900, Planck announced the derivation of his laws for black body radiation in a lecture 
before the Physical Society in Berlin. In order to derive these laws, he assumed that the 
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energy of harmonic oscillators can only take on discrete values - quite contradictory 
to the classical view, in which the energy values form a continuum. 

This date can be called the birth date of quantum theory. The further development 
of atomic and quantum physics is the subject of this book. 

Our knowledge of the structure of atoms was influenced strongly by the investiga- 
tion of optical spectra. After Kirchhoff and Bunsen had shown, about 1860, that 
optical spectra are characteristic of the elements which are emitting or absorbing the 
light, Balmer (1885) succeeded in finding an ordering principle in atomic spectra, ex- 
pressed in the formula (8.1) which bears his name and which describes the spectral lines 
emitted from hydrogen atoms. As a result of the atomic model proposed hy Rutherford 
(1911), Bohr was able, in 1913, to formulate the basic principles of the quantisation of 
electron orbits in atoms. These quantisation rules were considerably extended by Som- 
merfeld. A parallel development was the concept of matter waves, carried out by De 
Broglie. The actual breakthrough was attained by Born, Heisenberg, Schrodinger, 
Pauli, Dirac, and other researchers in the decade between 1920 and 1930. 

The problems of atomic physics which are of current interest in research are: 

- an increasingly detailed description of the structure of electronic shells of atoms and 
their excitations, 

- the interactions between atoms and radiation fields, for example in view of their 
applications in optical pumping (Chap. 21) and in laser physics (Chap. 22), 
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- the interactions of atoms among themselves due to collisions in the gas phase and 
during the formation of molecules, 

- the physical principles which lead to the formation of condensed phases from single 
atoms, and their properties. 

Thus molecular and solid state physics are based on atomic physics, and chemistry 
as well makes constant use of its laws and principles. 

Atomic physics is furthermore a basic science for many other disciplines of re- 
search, technology, and applications. A few examples are shown in Fig. 1.1. 

The following chapters do not give a historical or a chronological presentation; they 
do, however, show the general line of developments and discoveries. An inductive ap- 
proach is often used. It is of little use in physics to content oneself with the acquisition 
of factual knowledge. A physicist must learn to analyse, to explain, and to extract the 
essentials from experimental findings. In this way, one develops models for nature. In 
the process, it is important to recognise relationships to other experimental results and 
to be able to predict the outcome of new experiments. The predictions must then be ex- 
perimentally tested. Because of this process, physics is not a dead, finalised science, but 
rather is in a constant state of development, since new experimental techniques open up 
new areas of research while, on the other hand, the process of developing physical 
concepts gives the impulse for ever newer experiments. 




2. The Mass and Size of the Atom 



2.1 What is an Atom? 



An atom is the smallest unchangeable component of a chemical element. Unchangeable 
means in this case by chemical means \ i.e., by reactions with acids or bases or the effect 
of moderate temperatures, atoms may only be slightly changed, namely, in their degree 
of ionisation. Moderate temperatures refers here to temperatures whose equivalent 
energy kT{k\^ Boltzmann’s constant, T the temperature in K) is not larger than a few 
electron volts (eV) (see Table 8.1). 



2.2 Determination of the Mass 

Beginning with Dalton'^ law of constant and multiple proportions, and Avogadro's 
hypothesis, according to which equal volumes of gas contain the same number of mole- 
cules or atoms, we introduce relative atomic masses (also called atomic weights) A^^x. It 
was first discovered with the methods of chemistry that these atomic weights are ap- 
proximately whole-number multiples of the atomic mass of the hydrogen atom. The 
relative atomic masses of nitrogen and oxygen are then ^rei(N) — 14, ^rei(O) — 16. 

For this reason, an atomic mass unit has been defined, 1 u (abbreviation for unit, 
previously also referred to as 1 amu), which is approximately equal to the mass of a 
hydrogen atom. Since 1961 the unit of atomic mass has been based on the carbon atom 
with ^rei= 12.00000 u and is thus no longer exactly equal to the mass of the H 
atom. The use of C as base substance was found to be expedient for the experimental 
precision determination of atomic masses by chemical means. We have as definition 

lu = l/12of the mass of a neutral carbon atom with nuclear charge 6 

and mass number 12, i.e., . (2.1) 

Earlier scales were defined somewhat differently: the old “chemical” scale was based 
on oxygen in the naturally occurring isotope mixture: 

1 amUchern =1/16 (average mass of O atoms in the natural isotopic mixture) 

and the old “physical” scale was based on the oxygen isotope 

1 amu i6q =1/16 (mass of an atom) . 

The following conversion formulae hold: 



amuchern • amui6o : = 0.99996 : 0.99968 : 1 .00000 



( 2 . 2 ) 
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and 

^rei,chem:^rei,>«o:^rei,>2c= 1-00004: 1 .00032 : 1 .00000 . (2.3) 

The absolute atomic masses can be obtained from the relative masses using the concept 
of the mole. 

1 mole of a substance is, according to Avogadro, as many grams as the relative 
atomic weight (in the case of molecules, as the correspondingly defined relative molec- 
ular weight). Thus, 1 mole of the carbon isotope ’gC is 12 grams. 1 mole of any sub- 
stance contains the same number (Np^) of atoms (molecules). 

The number Np^ which is defined in this way is called Avogadro’s number (in the 
German literature, it is called the Loschmidt number after the Austrian physicist Lo- 
schmidt who determined it in 1865 by measurements on gases). Experimental methods 
for its determination will be discussed in the following section. 

The absolute atomic mass /Watom therefore be obtained by measuring 
Avogadro’s number. We have: 



Mass of an atom — 



Mass of 1 mole of the substance 



(2.4) 



The determination of atomic masses is thus based on the determination of Avogadro’s 
number; the size of the latter depends evidently on the choice of the base substance for 
the mole. is currently defined as the number of carbon atoms in 12.000 g of iso- 
topically pure *gC. 

The present best value for Np^ is 

Np, = (6.022045 ± 0.000005) • 1 0^^ mole “ ‘ . 



With this value, we can write (2.4) in the form 



Watom = [gram] . 

Na 



(2.5) 



For the conversion of the mass unit u into other units the following relations hold: 

1 u = (1.660565 ± 0.000005) • 10“^'^kg = 931.478 . (2.6) 

This last conversion results from the mass-energy equivalence £ = mc^. MeV is a meas- 
ure of energy (see Table 8.1), c is the velocity of light. Numerical values for masses m, 
relative atomic masses A^i, and the mass number >1 of a few atoms are shown in Table 
2 . 1 . 



Table 2.1. Mass number, mass, and relative atomic mass of several atoms 





Mass number 
A 


Mass m 
[kg] 


^rel 


H atom 


1 


1.67342-10-2'' 


1.007825 


C atom 


12 


19.92516-10-2'' 


12.000000 


O atom 


16 


26.5584:10-2' 


15.99491 
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The mass number A of an atom is the integer which is closest to its relative atomic 
mass Are[. It is found in nuclear physics that A is equal to the number of nucleons 
(protons and neutrons) in the atomic nucleus. 



2.3 Methods for Determining Avogadro’s Number 

2.3.1 Electrolysis 

In electrolytic decomposition of salts from a solution, the amount of salt decomposed 
is proportional to the charge which flows through the electrolyte. For one mole of a 
monovalent substance, a charge of 96485 As (ampere-seconds) is required. This is the 
Faraday constant F. Thus, since each ion carries one elementary charge e, we have the 
relation TVa = F/e. The elementary charge e denotes the charge on a single electron (see 
Sect. 6.3). For example, in order to electrodeposit one mole or 63.5 g of copper from a 
solution of CUSO 4 in water, 2 Aa electrons are necessary, since the copper ion is doubly 
positively charged. By weighing the amount of material deposited and measuring the 
electric current as well as the time, one can obtain the constant 

2.3.2 The Gas Constant and Boltzmann’s Constant 

The universal gas constant R and Boltzmann’s constant k are related through the equa- 
tion k = R/Nj!^ . 

The gas constant can be determined by means of the ideal-gas law pV = RT; the 
Boltzmann constant, for example, from sedimentation equilibria {Perrin, 1908). In the 
latter method, the density distribution of small suspended particles in a liquid, deter- 
mined by the simultaneous action of gravity and the Brownian molecular motion, is 
given by the equation 

= (2.7) 

where is the number of particles in a unit volume at a height h, Hq the number of par- 
ticles in a unit volume at height h = 0, m the mass of the particles, g the acceleration of 
gravity, k the Boltzmann constant, and T the absolute temperature. In Fig. 2.1 , a model 




Fig. 2.1. Sedimentation equilibrium: distribution of suspended mastix spheres 
of 0.6 pm diameter at four different heights in the field of view of a micro- 
scope (after Perrin) 




2. The Mass and Size of the Atom 



experiment demonstrating sedimentation is shown. The formula given by (2.7) is a 
special case of the famous Boltzmann distribution. Since we will use this distribution in 
numerous places in this book, we will give an explicit general formula for it here. The 
exact derivation may be found in texts on statistical physics. According to the Boltz- 
mann distribution, the number of particles with energies in the interval E ... E + dE in 
thermal equilibrium is given by 

n^dE = NZ(T) e dE , (2.8) 

where, generalising (2.7), the following quantities appear: Ais the total number of par- 
ticles, Z{T) is the partition function. The latter ensures that, integrating over the whole 
energy range, one obtains the total particle number N, i.e., j n^dE = N. It is therefore 
given by Z(T)~^ = \ E/kT) g{E)dE. Finally, g{E) is the density of states^ it is 
necessary since, for example, particles with the same energy can be moving in different 
directions, i.e., there can be more than one state with the energy E. 

A completely satisfactory definition of g{E) only becomes possible with the help of 
quantum mechanics. Using quantum numbers, of which we will later encounter a num- 
ber of examples, one can count the number of “states” in the interval E ... E-\-dE. 



2.3.3 X-Ray Diffraction in Crystals 

With x-radiation of a known wavelength, one can determine the lattice constant, or the 
volume of an atom or molecule in a crystal. The volume of a mole Fmoi is then Np^ times 
the atomic volume. For one mole one thus has 



^A^atom^ Vmol = M/g, (2.9) 

where M denotes the molar mass and g the mass density. 

Figure 2.2 illustrates the principle; it shows a section of a NaCl lattice. NaCl crystal- 
lises in the face-centred cubic structure. The NaCl lattice can be built up from two face- 
centred cubic lattices containing the Na"^ and the CP ions. These ions occupy the 
corners of cubes of side a/2, where a is the edge length of the Na^ or CP unit cell. The 
unit cell is the smallest unit of a crystal, in the sense that the crystal structure consists of 
a repetition of this element in each of the three dimensions of space. 

The size of a can be determined by x-ray diffraction if the x-ray wavelength is 
known (Sect. 2.4.5). In a cube of volume (a/2)^, there are 4/8 = 1/2 NaCl molecules, 
since each ion belongs to 8 cubes. The number of molecules per unit volume is therefore 






Fig. 2.2. Section of a NaCl lattice. The unit cell of the face-centred 
cubic lattice is a cube of side a. It contains one face-centred cubic cell 
each of Na'*' and of CP ions 
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« = (l/2)(2/fl)^ (2.10) 

When we set this equal to the quotient N^/ F^oi = q/M, we obtain 



N.= 



AM 

Q 



4-58.4 
(5.63)^- 10"^^ 



6.05-10^^(kmol)~' with 



-1 



= 6 . 05 - 10 ^^ 

•2.16 

a = 5.63 • 10“^cm 



mol ^ 
and 



or 

Q = 2.16 gem 



( 2 . 11 ) 



The accuracy of a measurement of by this method is about 5-10“^ (relative uncer- 
tainty). The density q cannot, up to now, be determined with greater accuracy. How- 
ever, the lattice constant a can be obtained with much greater accuracy using an x-ray 
interferometer of Si single crystals, resulting in a relative error of 6 • 10“^. This method 
becomes an absolute technique for determining Np^ if the measurement of the x-ray 
wavelength is made using a mechanically ruled grating and can thus be related to the 
meter directly. This becomes possible with the method of grazing-incidence diffraction; 
in the normal-incidence method, the mechanically prepared rulings are too wide 
relative to the wavelength. 



2.3.4 Determination Using Radioactive Decay 

Among the many other methods with which Np^ has been determined, we will only men- 
tion here that of Rutherford and Royds from the year 1909. 

In the experimental setup shown in Fig. 2.3, a radon source is contained in the in- 
terior of the glass tube A. The a particles which are emitted by this source can pass 
through the thin walls of tube A. In the second, thick-walled tube B, the a particles. 




Fig. 2.3. Experimental arrangement of Rutherford and Royds: 
Phil. Mag. 17, 281 (1909). The thin-walled glass tube A contains the 
or-active gas radon, ^g^Rn. The helium atoms which collect after 
some days in the evacuated space B are compressed into the capil- 
lary C and detected in the spectrum of a gas discharge. The mercury 
levelling vessels serve to compress the gases 
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which are the atomic nuclei of helium atoms, collect as He gas. Through ionisation pro- 
cesses in the source, in the glass walls, and in the gas, electrons are set free which can 
combine with the a particles and make them into He atoms. Ignition of a gas discharge 
in tube C excites these atoms and causes them to emit light; spectral analysis shows that 
the gas is, in fact, helium. 

In this manner, it was demonstrated that a particles are helium nuclei. If one meas- 
ures the quantity of gas which is formed in a certain time, and knows the decay rate of 
the source (e.g., by counting with a Geiger counter or scintillation detector), one can 
determine the number of atoms per unit volume and thus 



2.4 Determination of the Size of the Atom 

2.4.1 Application of the Kinetic Theory of Gases 

The kinetic theory of gases describes the macroscopic state variables of gases such as 
pressure and temperature on an atomic basis. Its application to the explanation of the 
macroscopically measurable quantities relevant to gases also leads to a determination 
of the size of the atoms. To understand this, we must first recall the arguments which 
provide convincing evidence for the correctness of the kinetic theory. 

The ideal-gas law states 



pV=nRT, (2.12) 

where p is the pressure, V the volume, n the number of moles, R the universal gas con- 
stant, and T the temperature. 

At constant temperature, this is Boyle’s law. Equation (2.12) can also be derived 
kinetically. To do this, one calculates the number of particles in a given volume which 
collide with a unit surface of the walls per unit time and thereby transfer momentum to 
the walls (this is the number of particles contained in the so-called Maxwellian cylinder 
of length v). The pressure which is exerted by the gas on the walls is then given by 

p = ii/3)Nmv^, (2.13) 

where m is the mass of the particles (gas atoms or molecules), is their mean-square 
velocity, and N is the number of particles per unit volume. Since the mean kinetic 
energy mv^/2 of a free particle in thermal equilibrium is equal to {3/2)kT, (2.13) be- 
comes p = NkT. This equation is identical to the ideal-gas law, as one immediately re- 
cognises upon multiplication by the molar volume V^aoh 



pV^^i = NV^^ikT = N^kT = RT. (2.14) 

The demonstration that the kinetic theory gives a good description of the physical be- 
haviour of gases is provided by experimental testing of the predictions of the theory. 
For example, the distribution of the molecular velocities in a gas which can be derived 
from the kinetic theory (Maxwell distribution) has been experimentally verified with 
great accuracy. This distribution is again a special case of the Boltzmann distribution 
[cf. (2.8)]. Here the energy of a particle isE = mv^/2. We wish to calculate the number 
of particles, n{v)dv, whose absolute velocity, independent of direction, lies in the 
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interval v ... v + dv. Thus we must recalculate the density function g(E) in terms of a 
new density function g(v), using the condition 

g{E)dE = g(v)dv . 



Since the calculations yield no physical insights, we will only give the end result, the 
Maxwellian velocity distribution: 



n(v)dv = riov 



m 

~kf 



3/2 

^-mvVlkT^^ 



(2.15) 



with n(v)dv being the number of particles with a velocity in the interval v v + dv 
and «o the total number of particles. In the experimental test of the velocity distribu- 
tion, the relative number of gas atoms with a given velocity v is measured. 



2.4.2 The Interaction Cross Section 

The size of an atom in a gas may be measured from the interaction cross section with 
which the atom collides with other atoms. The derivation of the concept interaction 
cross section is illustrated in Fig. 2.4. A beam of atoms of type 1 (beam cross-sectional 
area particle radius r ^ , particle number density Nq) strikes a layer made of atoms of 
type 2 (layer thickness Ax, particle radius a* 2 , particle number density n). We ask, 
“How many atoms of type 1 collide with those of type 2 and are deflected from their 
course, so that they do not pass undisturbed through the layer?” This interaction cross 
section is thus frequently referred to in physical language as a scattering cross section. 



©- 



— I- 



- 0 = 






Q 



N(x) N(x+Ax) 




Fig. 2.4. Definition and measurement of interaction 
cross sections. Upper part: The interaction cross sec- 
tion of particles with radius /*j with those of radius a *2 
is found to be {r^^-r 2 f‘n. Centre part: Out of Nq 
particles which reach the volume element of thick- 
ness Ax from the left, N pass through undeflected. 
In this manner, the interaction cross section may be 
experimentally determined. Lower part: Derivation 
of (2.20). The radii of particles 1 and 2 are combined 
into the radius {r^ -i- ^ 2 ) 
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The problem corresponds roughly to the following macroscopic situation: someone 
is standing in front of a forest and shoots a bullet from a rifle. The probability that the 
bullet will pass through the forest undeflected is larger, the smaller the thickness and 
the density of the trees. If one shoots repeatedly and counts the number of undeflected 
bullets relative to the total number fired, it is possible to determine the thickness of the 
trees (that is, their interaction cross section). In order to do so, one must know their 
density. Naturally, one would not use such a method for a real forest. In atomic 
physics, it is, however, necessary, since it is not possible to put a meter stick up to an 
atom as it is to a tree. The same problem occurs in a number of contexts in physics: for 
example, in nuclear physics, a collision between two particles can be used to determine 
the interaction cross section for a nuclear or particle reaction. In atomic physics, we 
shall see that the interaction between a light quantum and an atom is described by a 
cross section. Because of its wide application in many areas of physics, the concept of 
the interaction cross section will be treated in some detail here. 

A collision between atoms of radii and leads to a deflection of the atoms out of 
their initial directions when it occurs within an area a = (ri + r 2 )^n (see Fig. 2.4). We 
may thus combine the deflection of both colliding particles into a common cross 
section. The probability of a collision is then given as the quotient of the number of 
favorable to the number of possible cases: 



IV = 



Area of all the interaction cross sections in the volume of the beam 

Total area A 



This is valid under the assumption that the areas nr^ of various particles which are 
located behind one another do not overlap. This is fulfilled for a sufficiently small layer 
thickness. In order to calculate the number of deflected atoms in a finite layer of thick- 
ness L, we first divide up the layer into thin layers of thickness Ax, If atoms enter a 
thin layer at the position x (see Fig. 2.4), a number zlAis deflected out of the beam 
after passing through the distance Ax: 

^ ^ _ Total number of atoms in the volume • cr ^ ^ 

AN=-WN=- N, (2.16) 

Total area 



Since the total number of atoms in a given volume is given by the product of particle 
number density n with the area A and the layer thickness Ax, we obtain from (2.16) 



^ _ nAAxa , . 
AN= N, 



(2.17) 



If we replace differences by the corresponding infinitesimal quantities, we have 



dN/N= -nodx, (2.18) 

To obtain the number of atoms which are deflected (or not deflected, respectively) 
along the entire length x, we integrate (2.18): 

lnA= CTX + In Aq. (2.19) 

Here, InAo is a constant of integration, with Aq being the number of particles which 
are incident at the point x=0. From this relation we obtain immediately 
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N = NoCxp( - n ox) as the number of particles which are still present after a distance x, 
or, after passing through a total length L 

N = Noe-"^^. ( 2 . 20 ) 

The number of deflected atoms is correspondingly 

^scatt = ^o(l-e“"'"^). (2.21) 

The product na= a is also denoted as the (macroscopic) scattering coefficient and a as 
the (microscopic) total interaction cross section. 

From a measurement of a follows, according to o = {rx~\- rif'n, the quantity 
( a*i + ri). In the case of identical atoms with r = r^ = a * 2 , we have thus determined r, i.e., 
the size of the atoms. 
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Fig. 2.5. Attenuation of a light beam on 
passing through an absorbing medium 



We will frequently encounter the concept of the interaction cross section, which we 
have defined here, in later sections of this book. Figure 2.5 shows the dependence of 
the intensity of a light beam on the thickness of absorbing medium through which the 
beam has passed, as described by (2.20). For the absorption of light by atoms or mole- 
cules, the Lambert-Beers law is valid: 

/ = /oe"'^“^, (2.22) 

where / is the transmitted intensity, /q the incident intensity, and a the absorption 
coefficient per absorbing particle, n is again the number density of atoms or molecules 
in the absorbing medium. 
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2.4.3 Experimental Determination of Interaction Cross Sections 

Interaction cross sections can be directly measured by collision experiments using an 
atomic beam and a gas target. An apparatus for such measurements is shown in Fig. 
2 . 6 . 



Gas 




Pump 



Fig. 2.6. Setup for measuring scattering cross sections of atoms by atoms. A beam of gas atoms enters the 
scattering chamber through the slits S. Scattering by the gas atoms in the chamber leads to an attenuation of 
the beam which arrives at the catcher 



Frequently, however, interaction cross sections or atomic sizes are determined indi- 
rectly. For example, one measures the mean free path A, which we define with the help 
of (2.20): A is the distance L or x, after which the initial density Nq has been reduced to 
Nq/q. Thus, with (2.20), where n is again the particle number density, and taking 
rx = r2 = /*, 

1 1 

A = ^ = . (2.23) 

4nr n an 

Up to now, we have assumed that the target atoms are at rest. If they are also in 
motion, the expression for A must be modified somewhat. We give the result without 
derivation: 

A = . (2.24) 

4n]/lr^n 

The mean free path thus defined enters into macroscopically measurable quantities, 
for example the viscosity r}. The viscosity is in fact a measure of the momentum trans- 
fer between atoms or molecules in gases or liquids and therefore also depends on the 
frequency of collisions between the particles. The mean free path can thus also be 
macroscopically determined. The detailed relation is (without derivation) 

r} = \Qkv, (2.25) 

where rj is the viscosity, q the density, A the mean free path, and v the mean velocity of 
the particles. The quantity r\ can be measured, e.g., from the flow velocity through a 
capillary. 
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Another method for measuring A results from thermal conductivity in gases. The 
latter also depends on the frequency of collisions between the particles and the energy 
transfer which thus occurs. Here we have - likewise without derivation - a relation 
between the heat flow dQ/dt and the thermal gradient dT/dx which produces it: 

dQ/dt = - Atc • -4 • dT/dx , (2.26) 

where dQ is the differential quantity of heat, Atc the thermal conductivity, and A the 
cross-sectional area of the heat transport medium. The thermal conductivity Atc 
depends upon the mean free path according to the relation 

(2.27) 

Table 2.2 on p. 20 contains some values for atomic radii. Further details and the deriva- 
tions which we have passed over here may be found in standard texts on experimental 
physics. 



2.4.4 Determining the Atomic Size from the Covolume 

The Van der Waals equation for one mole of a real gas states 

{P+a/V^){V-b) = RT. (2.28) 

Here the expression a/V^ denotes the “internal pressure” which adds to the external 
pressure P and is due to the forces between the particles. Another correction due to the 
internal forces is the reduction of the free volume V of the gas by the volume of the 
gas particles (the so-called covolume). This quantity b, which can be experimentally 
determined by measuring the P-V diagram of the equation of state, is equal to the 
fourfold volume of the particles. We thus have 

b = ^-^- P -Nf,. (2.29) 



2.4.5 Atomic Sizes from X-Ray Diffraction Measurements on Crystals 

The famous experiment of von Lane, Friedrich, and Knipping in 1912 on the diffrac- 
tion of x-radiation in crystals yielded: 

- the final proof that crystals are built up of atoms, 

- the wavelength of x-radiation, 

- the lattice constant of crystals - and with it, information on the size of the atoms in 

the crystal. 

Figure 2.7 shows the experimental set-up schematically. For an exact derivation of 
the interference conditions, one would have to treat the interference from a three-di- 
mensional lattice. Here we will use the simplified method of Bragg (1913) to illustrate 
how the lattice constants can be determined. 

X-ray diffraction may be regarded as a reflection of x-radiation by the so-called lat- 
tice planes at certain specular angles. A lattice plane is a plane in a crystal which is oc- 
cupied by atoms. In a crystal there is a large number of families of parallel and equi- 




16 



2. The Mass and Size of the Atom 




Fig. 2.7. X-ray diffraction from a single crystal after von Laue; schematic of the experimental arrangement. 
X-radiation with a continuous distribution of wavelengths (polychromatic or white x-radiation) is diffracted 
by a single crystal. The conditions for interference from a three-dimensional lattice yield constructive 
interference at particular directions in space and at particular wavelengths. One thus observes interference 
maxima, which correspond to certain discrete wavelengths (monochromatic x-radiation) 



(110) (310) 




( 010 ) 



( 120 ) 



Fig. 2.8. Simple cubic lattice with several lattice planes. These are characterised by the Miller Indices. The 
spacing between two parallel lattice planes decreases with increasing Miller indices 





Fig. 2.9 a, b. Derivation of the Bragg Law of Re- 
flection. The horizontal lines symbolise lattice 
planes, from which the incident x-radiation ar- 
riving at angle 6 is scattered, a) Each atom of a 
lattice plane acts as a scattering centre, b) The 
derivation of the Bragg condition for the reflec- 
tion of x-radiation from a lattice plane 
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distant lattice planes. They are distinguished from one another by their spacing, by the 
density of atoms within the planes, and by their orientations within the crystal lattice 
(see Fig. 2.8). According to Huygens’ principle, each atom which is struck by the in- 
cident x-radiation acts as the source point for a new elementary wave (Fig. 2.9a). These 
elementary waves produce constructive interferences at certain angles. The reflection 
condition is derived as follows: amplification occurs when the path difference A be- 
tween two adjacent beams corresponds to a whole multiple of the wavelength, nX. For 
the path difference A we have, according to Fig. 2.9b, 

J = AB + BC-AE = 2AB-AE = ^^-2ADcos0. (2.30) 

sin 9 

With the relation AD = rf/tan0, one obtains from (2.30) 

/\ = 2 - ^ (1-cos^^) , 
sin 9 

or, finally, the condition for constructive interference 

A = 2dsin9= nX . (2.31) 

The various methods of observing x-ray diffraction from crystals which are used in 
practice differ in the following ways: 

- In the Laue method one uses a single crystal, a particular value of the angle of inci- 
dence, and x-radiation with a continuous spectrum (“polychromatic” x-rays). The 
condition for constructive interference is fulfilled for individual points in the plane 
of observation for particular wavelengths. 

- In the Bragg rotating-crystal method one also uses a single crystal, but mono- 
chromatic x-rays. The crystal is rotated, so that the angle of incidence varies 
through a continuous range of values. The condition for constructive interference is 
fulfilled for various lattice planes successively. 

- In the Debye-Scherrer method (Figs. 2.10, 11), the sample is polycrystalline or 
powdered. The x-rays are monochromatic. Since each lattice plane occurs in all pos- 
sible orientations relative to the incident beam, one obtains interference cones 
whose intersection with the plane of observation gives interference rings. 

Equation (2.31) relates the wavelength of the x-rays to the lattice constant or the 

spacing of the lattice planes. The x-ray wavelength can be measured by other means 
than with crystal interferences. Its measurement can be directly correlated to the meter 
by utilising x-ray interference at grazing incidence from a diffraction grating. Since it is 
not possible to manufacture diffraction gratings with a grating constant of the order of 
x-ray wavelengths, one uses coarse gratings, for example with 50 lines/mm, and lets the 
x-radiation strike the grating at a grazing angle of less than 1 ° . Since the index of 
refraction of x-rays is somewhat smaller than 1 , total reflection occurs when the angle 
of incidence is sufficiently small. The effective grating constant is then the projection 
of the actual line spacing at this angle. It is sufficiently small to permit the measure- 
ment of the x-ray wavelength. 

We make two additional remarks concerning x-ray diffraction. 

- In practice, x-ray diffraction is much more complicated than indicated above. The 
exact intensity distribution in the diffraction pattern must be carefully determined, 
and account must be taken of the fact that the scattering centres are not points, but 
instead are electronic shells with a finite extension. A complete, quantitative ana- 
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Fig. 2.10. Debye-Scherrer method: x-ray 
diffraction of monochromatic x-radia- 
tion by a polycrystalline sample Z. On 
the film, the intersections of the diffrac- 
tion cones from the various families of 
lattice planes appear as rings. To 
generate monochromatic x-radiation, 
one uses either the socalled characteristic 
x-ray lines (Fig. 18.3), or a single crystal 
may be employed as a monochromator 
according to the principle of (2.31) 




Fig. 2.11. Debye-Scherrer diagram of MgO [from Gerthsen, Kneser, Vogel: Physik, 13th ed. (Springer, Ber- 
lin, Heidelberg, New York 1978) Fig. 12.37] 



lysis of x-ray diffraction patterns leads finally to an exact determination of the elec- 
tron density with the sample crystal lattice. From it, we obtain not only the spacing 
between the atoms in the lattice, but also their sizes and even their shapes. Figures 
2.12 and 2.13 illustrate experimentally determined electron density distributions in 
crystals. A contour map of this type raises the question, “Where does an atom have 
its boundary?”, and this leads in turn to the question, 

“What do we really mean by the size of an atom?” 

- In the case of hard spheres, the size can be defined exactly. For atoms, the concept 
“size” cannot be defined without reference to the method of measurement. Various 
methods are sensitive to different properties of the atom, which depend on the “size 
of the atom” in differing ways. 

Let us consider the methods of investigation described above once more in light of 
this remark. 

From the viscosity rj one obtains a measure of the interatomic distance in the 
presence of thermal motion. Because the atoms are not perfectly hard spheres, the 
radius determined in this manner will, however, be a function of the velocity. Further- 
more, the results depend on the shape of the atom; the spatial extension of the elec- 
tronic shells of atoms and molecules deviates more or less strongly from a spherical 
shape, in general. 

The covolume b in the real-gas law is derived under the assumption that the atoms 
are elastic spheres. The lattice plane spacing d measures an equilibrium distance 
between the particles in the crystal lattice. 
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Fig, 2,12, The electron density 
distribution in the anthracene 
molecule determined by V, L. 
Sinclair, J. M, Robertson, A. 
McL, Mathieson: Acta Crystal- 
logr. 3, 254 (1950). Lines of equal 
electron density (“contour lines”) 
are plotted 














1 





Fig. 2.13. Distribution of the electron 
density in the basal plane of NaCl from 
x-ray analysis by Schoknecht: Z. Natur- 
forsch. 12a, 983 (1957). The solid curves 
are again lines of equal electron density 
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It should therefore not be surprising that the values of atomic radii measured by the 
various methods deviate somewhat from each other - the order of magnitude is, 
however, always the same - a few Angstroms. Table 2.2 shows a comparison of the 
measured values. 



Table 2.2. Atomic radii [A], measured by various methods (1 A = 0.1 nm) 





from rj 


from d 


from b 


Neon 


1.18 


1.60 


1.2 


Argon 


1.44 


1.90 


1.48 


Krypton 


1.58 


1.97 


1.58 


Xenon 


1.75 


2.20 


1.72 



2.4.6 Can Individual Atoms Be Seen? 

The resolving power of a microscope is defined as the smallest spacing between two 
structures in an object which can still be imaged separately. According to Abbe’s 
theory of image formation, the resolving power is limited by diffraction in the opening 
of the lens which forms the image. In texts on optics, the condition that - in addition 
to the zeroth order — at least one additional diffraction maximum is necessary in order 
to form an image, is used to derive the equation for resolving power, 

d = — T — . (2.32) 

rtsma 

where d is the resolving power, X the wavelength, n the index of refraction, and a the 
angular opening of the lens. For visible light, one obtains a resolution of ca. 5000 A or 
500 nm. 

For other types of electromagnetic radiation, the theoretical resolving power cannot 
be reached. For x-rays, it is not possible to construct suitable lenses, since the index of 
refraction of all substances for x-radiation is approximately equal to 1 . Electrons may 
be deflected by electric and by magnetic fields; thus, they may be used to construct 
lenses for electrons and to form images. Because of the unavoidable “lens 
aberrations”, however, it is only possible to work with beams of very small divergence 
in electron microscopes. Table 2.3 gives an overview of the resolving powers of various 
methods of image formation. 



Table 2.3. Resolving powers for various wavelengths 





Resolving Power [A] 




Remarks 




theory 


practice 




Light 
Dark field 

X-rays (A = 1 A) 

Electrons 

(100000 Volt ^ A = 0.037 A) 


ca. 5000 
ca. 500 

1 

0.04 


ca. 5000 
ca. 500 

several 100 
0.7 


No image formation, 
only diffraction pattern 
No lenses 
Lens aberrations 
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In recent years, great success in the effort to form images of individual atoms has 
been obtained with special types of electron microscopes: with the field emission micro- 
scope it has been possible to visualize single atoms or large molecules on the tips of 
fine metal points (Fig. 2.14), and with the scanning electron microscope it has proved 
possible to form images of atoms and molecules. Here the attainable resolution is 
about 5 A or 0.5 nm. With high-voltage electron microscopes, one can now obtain a 
resolution of 0.15 nm. This makes it possible to image the individual atoms in mole- 
cules and in crystals. An example is shown in Fig. 2.15. 

Finally, using the scanning tunnel microscope, it is possible to resolve the atomic 
or molecular structure of surfaces and to make the individual structural elements visi- 
ble. In this apparatus, which was developed by Binnig and Rohrer, an extremely fine 
metal point is moved over the surface to be observed at a distance of less than 1 nm. 




Fig. 2.14. Image of the point of a tungsten needle with a field emission microscope, developed 
by F. W. Muller. The picture was provided by the Leybold-Heraeus Co. in Cologne. The 
image can easily be produced as a lecture demonstration. The various lattice planes which 
intersect the tip of the needle have differing emission probabilities for the field emission of 
electrons; this produces the geometric pattern of light and dark regions. This pattern reflects 
the crystal lattice geometry. Barium atoms were then vapour-deposited onto the tungsten 
needle. Where they are present, the emission probability is increased, so that they appear as 
bright points in the image. During the vapour deposition one can observe them falling onto 
the point like snowflakes. It can be shown in this manner that individual atoms become visible 








Jl 1 €#> t ^ 



Fig. 2.15. An electron microscope picture of hexa-deca- 
chloro-copper-phthalocyanin molecules. The molecules 
were produced as a ca. 10 monolayer thick, crystalline 
growth-layer on the alkali halide crystal which serves as 
substrate. The image formation and processing were done 
with a 500 kV high-resolution electron microscope and 
with a special image enhancement technique. The central 
copper atoms are especially clear, as are the 16 peripheral 
chlorine atoms. (The picture was kindly placed at our dis- 
posal by Prof. N. Uyeda, Kyoto University) 
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The tunnel current between the substrate and the point is a measure of the distance, 
of the charge density, and of the electrical work function for charges leaving the 
substrate. These quantities change from place to place on an atomic scale; thus, one 
can obtain a picture of the substrate surface. In Fig. 2.16, the atomic structure of 
single molecules of copper phthalocyanine is shown. 

The imaging and manipulation of individual atoms or molecules on solid surfaces 
has become possible with this instrument. More details are given in Haken and Wolf, 
Molecular Physics and Elements of Quantum Chemistry, Sect. 2.1. 




Fig. 2.16. A picture obtained with the 
scanning tunnel microscope, showing 
copper phthalocyanine molecules on a 
copper substrate surface. From P.H. 
Lippel, R.J. Wilson et al., Phys. Rev. 
Lett. 62, 171 (1989). At the lower right, 
the calculated charge density 2 A above 
the molecular plane is shown 



Individual atoms can thus in fact be made visible. Much older are methods of ex- 
perimentally observing processes which involve single atoms. 

Single atomic processes (nuclear decays) were made visible in the first years of re- 
search into radioactive decay by using the ''spinthariscope” (Fig. 2.17). This is nothing 
more than a fluorescent screen, which produces light flashes upon bombardment with 
decay products from radioactive material and which may be observed with a 
magnifying lens. With this instrument, single atomic events - decays - were counted 
in Rutherford’s laboratory at the beginning of this century. Today, scintillation 
detectors or semiconductor detectors are used for this purpose. 

A scintillation detector operates according to the following principles: when radia- 
tion from a radioactive decay falls on a Nal crystal, the crystal produces light flashes, 
which can be amplified in a photomultiplier tube (PMT) (Fig. 2.18). In this way, in- 
dividual events can be conveniently registered. For example: one electron with an 
energy of 10000 eV produces ca. 200 light quanta in the scintillator (it requires on aver- 
age about 50 eV per light quantum). Each light quantum creates one photoelectron at 
the photocathode of the PMT. The PMT amplifies each of these electrons about 
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Fig. 2.17. Spinthariscope, schematic illustration. 
The fluorescent screen scintillates due to the irradia- 
tion from the radioactive source. The scintillation 
processes may be observed through the magnifying 
lens 



Fig. 2.18. Schematic diagram of a scintillation detec- ► 
tor. The light flashes which are produced in the scin- 
tillator by the incident radiation pass through the 
light guide to the photocathode of an electron multi- 
plier tube. The photoelectrons which are released by 
the cathode are amplified in a series of steps and 
registered at the anode as a current pulse 



+ 







Scintillator 



Light guide 
Photocathode 



L Electrodes 
f ("dy nodes") 



Anode 



Counting 

electronics 



10^-fold, so that per light quantum, about 10^ electrons are released from the anode. 
This results in a charge per incident electron (beta particle) of which can 

easily be measured. 

Individual ions can be caught in an electromagnetic ion trap, stored there, and spec- 
troscopically studied as isolated particles. An ion trap is shown in Fig. 2.19. It localizes 
the particle by making use of inhomogeneous electromagnetic fields of suitable geome- 
try (quadrupole field). The localized ion can be excited to luminescence with narrow- 
band laser light; Fig. 2.20 shows an example. Thus, the interaction between the radia- 
tion field and individual ions can be investigated. Such studies are also the goal in the 
development of the one-atom maser, as described by H. Walt her (e.g. H. Walther: 
Europhysics News 19, 105 (1988)). Here, it is possible to observe the energy exchange 
between an individual atom and the electromagnetic field in a resonant cavity. 

An arrangement which played an especially important role in the early period of 
modern atomic physics and which is still in use today for the excitation of atoms and 
for producing particle beams is the gas discharge tube. Fig. 2.21. It can be employed 
both for exciting the emission of light from the atoms of the gas inside the tube and for 
the production of cathode and canal rays. Plucker described cathode rays for the first 
time in 1859. They were given that name because they could be observed through a hole 
in the anode and seemed to emanate from the cathode. In fact, they are generated in the 
gas volume. The rays which strike the fluorescent screen in front of the hole in the 
cathode, the canal rays, were discovered in 1886 by Goldstein, In 1897, Thomson show- 
ed that the cathode rays consists of negatively charged particles - the electrons. Wien 
demonstrated in 1900 that the canal rays are electrically charged atoms, that is, ions. 
They also are formed in the region of the gas discharge through collisions with charged 
particles which have been accelerated by the electric field. 

Atoms as the basic particles of matter have been thus theoretically and experimen- 
tally detected and made visible in the course of the past century. 
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Fig. 2.19. Schematic view of an electrodynamic ion trap (Paul 
trap). It is made of a ring with a hyperbolic surface and two 
hyperbolic caps. A high-frequency electric field maintains the 
ions on small circular orbits; the field vanishes at the center 
point. From P.E. Toschek, The Art of Measurement (VCH, 
Weinheim 1988). More information can also be found in F. 
Diedrich and H. Walther: Phys. Rev. Lett. 58, 203 (1987). See 
also the Nobel prize lecture of W. Paul: Angew. Chem. Inti. 
Ed. 29, 739 (1990) 




Fig. 2.20. Imaging of individual atoms. In the center of 
an ion trap, a barium ion has been localized and excited 
by laser light to resonance fluorescence; it becomes visi- 
ble as a luminous spot. From P.E. Toschek and W. 
Neuhauser: Atomic Physics 7, ed. by D. Kleppner and 
F.M. Pipkin (Plenum, New York 1981) 
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rays rays 




Pump 



Fig. 2.21. A gas discharge tube for producing cathode rays. 
Between the cathode and the anode is a potential difference 
of several thousand volts. The fluorescence which appears 
in the tube and the formation of cathode rays depend 
strongly on the gas pressure. In the field-free region between 
the cathode and the fluorescent screen one observes the 
canal beam, which consists of positive ions. The cathode 
and canal rays produced in this manner were particularly 
important in the early period of atomic physics for the in- 
vestigation of charge, mass, and scattering cross sections of 
electrons and ions 
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Problems 



2.1 a) Colloidal particles are dispersed in a liquid. Show that at equilibrium, the num- 
ber of particles per cm^ is given by a Boltzmann distribution: 



n{h) = «o exp- 



-AV{Q-Q')gh 

RT 



where Np^ is Avogadro’s number, R is the gas constant, T is the absolute temperature, 
Kis the volume of the particles, q is the density of the particles, q’ is the density of the 
liquid, g is the acceleration due to gravity, h is the height and «o is the number of par- 
ticles at the height h = 

b) Determine Avogadro’s number, using the above relation and the following ex- 
perimental data: 

riQ = 134 particles/cm^ n{h = 0.0030 cm) = 67 particles/cm^ q = 1.23 g/cm^ q' = 
1.00 g/cm^ T = 293 K, particle diameter = 4.24* 10"^ cm. 

Hint: To derive an expression for n{h), use the barometric altitude formula: 
dp = -Q{h)gdh. Treat the particles as heavy, non-interacting molecules of an ideal 
gas and use the ideal gas equation = RTio obtain the relation between dp and dg or 
dn. 



2.2 Liquid helium (atomic weight 4.003) has a density ^ = 0.13 g/cm^ Estimate the 
radius of a He atom, assuming that the atoms are packed in the densest possible confi- 
guration, which fills 74% of the space. 

2.3 Canal rays, i.e., positive ion rays are generated in a gas discharge tube. How often 
does an ion {r = 0.05 nm) collide with an atom of the ideal filler gas (r = 0.1 nm) if it 
travels 1 m in a straight path through the discharge tube and if the pressure in the tube 
is 1 mbar? 10“^mbar? lO^'^mbar and the temperature T= 300K? (1 mbar corre- 
sponds to 10^ Pa). 

Hint: The ions do not have a Maxwell-Boltzmann velocity distribution. All the particles 
are assumed to have the same velocity. 

2.4 The covolume of helium gas was determined from pressure-volume diagrams to 
be Z? = 0.0237 litre/mole. The covolume of mercury is 0.01696 litre/mole. What is the 
size of the atoms in the two gases? 

2.5 a) Why are monochromatic x-rays used for the Debye-Scherrer method, and how 
are they produced? Does the diffraction cone with the smallest apex angle represent 
the smallest or the largest lattice plane spacing? How large is this spacing if a first-order 
angle a = 5° is measured between the surface of the cone and the undiffracted beam? 
(Assume that the quantum energy of the x-rays is 50 keV, E'x-ray = 50 ke V). 

b) The angle of maximum intensity of the first order reflection of x-rays with 
wavelength A = 2.1 A from the cleaved surface of a NaCl crystal is measured to be 
6 = 22° 10'. Calculate the lattice constant of the NaCl crystal. Use the result to derive 
Avogadro’s number. The density of NaCl is ^ = 2.163 gcm~^ 
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2.6 Monochromatic x-rays (A = 0.5 A) fall on a sample of KCl powder. A flat photo- 
graphic plate is set up 1 m from the powder and perpendicular to the beam. Calculate 
the radii of the sections of the Bragg diffraction cone (Fig. 2.10) for first- and second- 
order diffraction, assuming a lattice-plane spacing of 3.14 A. 

2.7 A tight bunch of slow neutrons (2 eV), which is produced in a nuclear reactor, 
lands on a crystal with a lattice spacing of 1 .60 A. Determine the Bragg angle for first- 
order diffraction. 

Hint: Use (7.1) for the wavelength of the neutrons. 

2.8 Atoms of an unknown element are deposited onto a perfectly planar crystal sur- 
face and are to be studied using a scanning tunneling microscope. The microscope 
employs a tungsten needle tip. What is the minimum size of the unknown atoms {r^), 
and how far apart must they be spaced, in order that they can be distinguished (take 
e.g. the criterion that the minimum measureable height Ah = 30% of the W atomic 
diameter)? 

Hint: Use a geometric sphere model, in which the radius of the W atom at the tip is 
0.16 nm. 




Atomic volume 



3. Isotopes 



3.1 The Periodic System of the Elements 



One of the early significant achievements of atomic physics in the past century - or 
rather, of chemists working together with the physicists - was the explanation of the 
periodic system of the chemical elements on the basis of atomic structure. 

This system (Table 3.1) was first proposed in 1869 by Medeleev and independently by 
Lothar Meyer, It is constructed by listing the atoms according to increasing nuclear charge 
number (or atomic number), Z. In the process, the chemical properties of the atoms are 
taken into account, so that chemically similar atoms are placed under each other in col- 
umns. With this procedure, we find eight vertical columns with subgroups and seven 
horizontal rows or periods. Each position is occupied by an atom which belongs there 
because of its chemical properties. To be sure, in this system all fourteen rare earths would 
have to be placed in the same position, i.e., at Z = 57, and all the actinides in position 
Z = 89. Both the periodicity and the above mentioned discrepancies will be explained on 
the basis of the electronic structure of the atoms near the end of the book in Chap. 19. 

Using heavy-ion accelerators, particularly the one operated by the GSI in Darm- 
stadt, Germany, it has been possible to discover additional transuranic elements since 
1980. By the end of 1995, elements up to atomic number 111 had been artificially pro- 
duced. These elements have unstable nuclei and many are so short-lived that their prop- 
erties are for the most part still unknown; even the process of naming them is as yet 
unfinished. 

The periodic system is an ordering of the elements according to periodically recur- 
ring chemical as well as physical properties. As an example of the latter we show here 
the atomic volumes and the ionisation energies as functions of the nuclear charge Z 
(Fig. 3.1). Chemical properties which periodically repeat themselves are, for example, 
the monovalence of the alkali atoms or the lack of reactivity of the rare gases. These 
empirical regularities indicate corresponding regularities in the atomic structure. 




large ionisation energies of the noble gas atoms 




3.2 Mass Spectroscopy 
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A first attempt at an explanation was the hypothesis of Prout (1815): all atoms are 
made up of hydrogen atoms. This picture was refined and modified as further ele- 
mentary particles were discovered, first the electron, then the proton. Only after 1932 
did it become clear that the atomic nucleus consists of neutrons as well as protons. The 
number of electrons of an atom is smaller than the mass number, since the nucleus con- 
tains just as many protons as the electronic shells have electrons, but it also contains 
neutrons. 

The relative atomic masses could originally only be measured by chemical 
methods. By these means, it was determined that the addition of hydrogen atoms alone 
cannot explain the observed “atomic weights” without contradictions. If the model of 
Prout is correct, then the atomic weights must be integers. For the most part, they are 
integers to a good approximation; A and A^^x ^re nearly equal. However, there are 
counter examples: the relative atomic mass - the atomic weight - of chlorine, for 
example, is y4rei=35.5 in the naturally occurring element. Furthermore, it was 
determined that lead from various ores had differing atomic weights. Today we know 
that this is due to the lead having been produced as the end product from different 
radioactive decay chains. 

These observations were the starting point for investigations which led to the dis- 
covery of the isotopes. This term denotes the fact that atoms with differing mass num- 
bers may belong to the same position in the periodic table, i.e., they may have the same 
nuclear charge number Z. The position of an element in the Periodic Table is determin- 
ed by the number of protons in its atomic nucleus. The differing mass numbers result 
from the different numbers of neutrons in the atomic nuclei. The concept of isotopes 
will be treated in the following sections. The existence of isotopes was discovered and 
thoroughly investigated with the aid of mass spectroscopy. 



3.2 Mass Spectroscopy 

3.2.1 Parabola Method 

The physical techniques for exact measurement of atomic masses and for separating 
atoms with differing masses are mostly methods for determining the ratio e/m, i.e., the 
ratio of charge to mass. For this purpose one uses the deflection of ionized atoms mov- 
ing through electric fields E and magnetic fields B. 

The oldest and most easily understood method is the parabola method of Thomson 
(1913). An ion beam from a gas discharge passes through the electric field of a conden- 
ser and a magnetic field B which is oriented parallel to the electric field (Fig. 3.2). In the 
plane of observation, particles of the same charge and mass, but having different 
velocities v, are distributed along a parabola whose origin is in the point where the 
undeflected beam would pass. This can be shown in the following manner: The homo- 
geneous electric field E, which is applied in the y direction, causes a deflection in this 
direction. The y coordinate of the particles changes according to the equation for the 
acceleration: 



y = {e/m) • E . 



(3.1) 



The y coordinate itself is given by the solution of (3.1), 
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Fig. 3.2. Schematic representation of the parabola 
method. The ion beam, collimated by the slit S, is 
deflected by the magnet M and the condenser C in 
the X and y directions. Equation (3.5) describes the 
path of the particles on a catcher screen immediately 
after exiting from the magnet and the condenser. If 
the screen is placed at a greater distance, a corre- 
sponding distortion of the parabolas due to projec- 
tion is seen. Both branches of the parabola are valid 
solutions if the B field is allowed to change its sign 



(3.2) 



where the last equation is found by expressing the time spent by the particles in the 
electric field in terms of the velocity v and the length / of the condenser. This is per- 
mitted as long as the magnetic field B is sufficiently weak and thus the radius r is 
sufficiently large. Since the deflection of the particles in the y direction is inversely 
proportional to the kinetic energy mv^l2, the condenser is referred to as an energy 
filter. 

The homogeneous B field, which is also applied in the y direction, produces a de- 
flection in the x direction. This deflection can be calculated as follows: 

The particles which enter the homogeneous B field are forced to follow circular 
orbits in a plane perpendicular to the direction of the field (y direction). Since, how- 
ever, the B field is limited in spatial extent (Fig. 3.2), the particles pass through only a 
segment of this circular orbit and then move on in a straight line. The resulting deflec- 
tion in the x direction may be derived by means of the radius of curvature of the cir- 
cular orbit, which is obtained by setting equal the magnitudes of the Lorentz force in 
the magnetic field, F = e(v xB), and of the centrifugal force = mv^r/r^: 



r = mv/eB . 



(3.3) 



For the centrifugal acceleration a^ = v^/r we obtain [by inserting (3.3) for the 
radius] the following relation: 

Gc = eBv/m . 



Since the particle only moves through a relatively short segment of the circle, we may 
replace its acceleration in the x direction with the centrifugal acceleration Gc- The total 
deflection in the time t is given by 

JC = Gj'^/l . 



In this equation, we replace g^ by eBv/m and the time of flight t by the quotient l/v, 
where / is the distance traveled in the field. We then obtain for the deflection in the x 
direction 

eBl^ 

2mv 



X = 



(3.4) 
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12 13 14 15 16 

C+ CH+ CH,+ CH3+CH4+ 



Fig. 3.3. Separation of a mixture of hydrocarbon 
ions with the Thomson parabola method. For cali- 
bration, one uses ions of known mass. The inten- 
sities of the individual parabolic sections correspond 
to the relative amounts within the mixture of the 
ions which produced them. [Photo after Conrad 
from W. Finkelnburg: Einfuhrung in die Atomphy- 
sik, 11, 12th ed. (Springer, Berlin, Heidelberg, New 
York 1976) Fig. 12] 



The X deflection is inversely proportional to the momentum mv of the particles. For 
this reason, one often calls the magnet causing the deflection a momentum filter. From 
the expressions for x and y we can eliminate p, so that we obtain the equation for the 
orbit of deflection of the particles: 



IE 




(3.5) 



This is the equation of a parabola = Ipy with the parameter p = tl^B^/AmE. This 
parameter has the same value for ions with the same ratio m/e but with differing veloc- 
ities V. An example of a measurement is shown in Fig. 3.3. 

The total intensity of the partial beam which produces a particular parabola is a 
measure of the relative abundance of the corresponding ion or isotope. Since the ions in 
general have differing velocities due to their preparation in an oven or a gas discharge 
tube, those ions having the same values of m/e will be distributed over the entire length 
of a particular segment of a parabola. 

Aston used this method in 1920 to investigate the composition of naturally occur- 
ring neon, which consists of 3 types of atoms with the mass numbers 20, 21, and 22; 
this was the first exact demonstration of the existence of isotopes by means of mass 
spectroscopy (Table 3.2). 

In any case, the most important result of the measurements with the parabola meth- 
od was the following: many elements consist of several isotopes, that is atoms with the 
same nuclear charge number Z and differing mass numbers A. Nuclei with particular 
values of A and Z are referred to as nuclides. 
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Table 3.2. Isotopic composition of neon. The values of given were not determined with the parabola 
method, but rather, with the precision quoted, by the use of a double-focussing mass spectrometer 



jNe 


90.920/0 


^rel = 19.99244 


)Ne 


0.26<^^o 


^rei = 20.99385 


jNe 


8.820/0 


= 21 .99138 



3.2.2 Improved Mass Spectrometers 

The first essential improvement of Thomson's mass spectrograph was achieved in 1919 
by Aston, namely the introduction of velocity focussing. He did not use parallel electric 
and magnetic fields as in the parabola method, but rather perpendicular fields E and B. 
The E field splits up the incident particle beam according to m/e, but also according to 
different velocities. By proper adjustment of the field strengths, one may however 
ensure that the B field brings all the particles with differing velocities together at a par- 
ticular point in space, while particle beams with different m/e ratios remain separated. 
Particles with the same m/e ratio are collected at one point by the detector and not 
along a parabolic segment as in the parabola method (Fig. 3.4). 

An apparatus with velocity focussing thus has a higher transmission for the ions 
than one which uses the simple parabola method, i.e., one can detect smaller amounts 
of ions and so, by closing down the slits, obtain a better mass resolution. The resolu- 
tion attained by Aston (1919) was about 130 for the ratio m/Am, that is, for the mass 
divided by the separable mass difference Am. 

The second major improvement was the introduction of directional focussing (first 
done by Dempster in 1918). By means of properly dimensioned sector fields (Fig. 3.5), 
it can be ensured that ions with the same m/e ratio but with somewhat differing angles 
of incidence, which are therefore deflected by differing amounts, are again collected at 
a point. 

In modern high-resolution mass spectrographs, both methods - velocity and direc- 
tional focussing - are used, leading to what is called double focussing. The precision 
attainable today for the relative atomic masses ylrei is down to 10“^ u. The same criteria 
apply as for optical spectrographs: by using narrow slits one obtains high resolution. 




Magnetic 

field 




Fig. 3.4. A focussing mass spectrograph as designed by Aston. The 
points 1, 2, and 3 denote the points at which three types of particles 
with three different values of m/e are collected 



Fig. 3.5. Directional focussing in a magnetic sector field, schema- 
tically illustrated. Particles which pass a longer distance through 
the region of magnetic field are more strongly deflected 
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''^ODH ’®0DH2 



^°He ^Sqd'^OH: 



" r. 



Fig. 3.6. An example of high-resolution mass 
spectroscopy: separation of 10 different ions 
with mass number 20, whose atomic or molec- 
ular weights lie between 19.9878 and 20.0628. 
The picture was made with the double-focus- 
sing mass spectrometer of Mattauch and co- 
workers [from W. Finkelnburg: Einfiihrung in 
die Atomphysik, 11, 12th ed. (Springer, Ber- 
lin, Heidelberg, New York 1976) Fig. 15] 



but at the cost of intensity. This represents the principal problem for the experimen- 
talist. The high resolution is mainly needed for nuclear physics problems, e.g., for the 
measurement of the so-called mass defect, but also for problems in analysis and struc- 
ture determination in chemistry. Sect. 3.2.4. The resolution m/Am which can be at- 
tained at present, i.e., the possibility of separating two masses with the values m and 
m-\- Am, is more than 100000. An example is shown in Fig. 3.6. 



3.2.3 Results of Mass Spectrometry 

In atomic physics, mass spectrometers are primarily of interest as instruments for 
analysing the isotopic composition of chemical elements. 

An element often has several isotopes, for example chlorine: an isotope with mass 
number 35 occurs with an abundance of 75.4<^^o; the other stable isotope with mass 
number A = 3>1 has an abundance of 24.6%. The resulting relative atomic mass of the 
isotope mixture is ^rei = 35.457. There are elements with only one stable isotope, for 
example 

27 a 1 127 T 

4 t 5 e, 13AI, 53I, 

and others with two stable isotopes, e.g., 

}H 

99.986% 0.014%, 

and finally there are elements with many stable isotopes. For example, mercury, soHg, 
has 7 stable isotopes with A between 196 and 204. A few further examples are con- 
tained in Table 3.3. 
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Table 3.3. Some examples of isotopes 

Mass number Rel. atomic weight Abs. atomic weight [10"^^ kg] 



'H 


1 


1.007825 


1.67342 




2 


2.014102 


3.34427 


12c 


12 


12.000000 


19.9251 


16 q 


16 


15.99415 


26.5584 


”C1 


35 


34.96851 


58.0628 


3’Cl 


37 


36.965898 


61.37929 



3.2.4 Modern Applications of the Mass Spectrometer 

Aside from precision measurements in atomic and nuclear physics, mass spectrometers 
with limited mass resolution are utilised today in many applications in science and tech- 
nology. 

In chemistry, simplified double-focussing spectrometers are used for analytical pur- 
poses. The molecular fragments which result from electron or ion bombardment of 
molecules are identified; from their distribution, a clue to the identification of the orig- 
inal molecules is obtained. 

In physics, chemistry, and technology, simple, compact spectrometers are used to 
analyse residual gases in vacuum systems. For this purpose, a mass resolution of 
m/Am = 100 is usually sufficient. 

A further application of these relatively simple spectrometers is the production of 
pure atomic and molecular beams. Recently, high-frequency mass spectrometers have 
been applied for this purpose. In these so-called time-of-flight spectrometers, charged 
particles are differently accelerated by high-frequency electromagnetic fields depending 
on their specific charges, and pass through the spectrometer with different velocities. 
The different times of flight (through the spectrometer) are a measure of the ratio 
e/m. 

In a quadrupole mass filter, the superposition of direct and alternating potentials 
on the four cross-connected, parabolically shaped electrodes results in an inhomoge- 
neous high-frequency field in the interior of the electrode assembly. A static field is 
superposed on the high-frequency field. Only particles with a particular mass and 
energy can pass through a filter with a given geometry and frequency (Fig. 3.7). 



(U+Vcos ®t) 





Fig. 3.7. Schematic of a quadrupole mass filter. The ion 
beam, moving in the +z direction, is deflected by a high- 
frequency alternating potential. In order for the beam to 
pass through the filter, a certain relation between e/m, the 
frequency cu, and the deflection voltages U and V must be 
fulfilled. The dashed orbit applies to an ion for which this 
condition is not met 



Ion source 
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3.2.5 Isotope Separation 

The separation of isotopes is more a problem of technology and nuclear physics than of 
atomic physics, which is the main topic of this book. For this reason, we will only brief- 
ly treat the subject here. 

In principle, any method which can distinguish particles on the basis of a physical 
property depending on the mass may be used to separate isotopes. Which one is em- 
ployed in a particular application depends on questions of economics and the state of 
the technology. The requirements are rather varied. 

Separation of the two hydrogen isotopes and jH with a mass difference of 100% 
is relatively easy, while by contrast the separation of and is considerably more 
difficult. In the latter case, the masses differ by only 1.25%. In the following, the most 
important methods will be briefly described. 

Electromagnetic separation with mass spectrographs is usually expensive and slow. 
The yields which can be obtained are of the order of 1 mg per hour at a current of 
10 A. For example, 35 g Cl as singly charged ions corresponds to 96500 As transport- 
ed charge. At a current of 10 A, 35 g of Cl will be deposited in a time 



9.65 *10^ As 
10“^A 



= 9.65 • 10^ s = 30 years . 



Nevertheless, this technique is applied on a large scale for isotope separation, for 
example for separating uranium isotopes, initially for the manufacture of uranium fis- 
sion bombs. The necessary investment of technology and energy is enormous. 

Mass separation by means of diffusion through porous membranes is based on the 
fact that in a gas, particles of differing masses and mi have different velocities vx 
and Vi at a given temperature. The following relation holds: 

vx/vi = ]/mi/mx , since mxv\=miv\, 

that is, the mean kinetic energy for both types of particle is the same. Light atoms are 
therefore on the average faster and diffuse more quickly. To obtain efficient isotope 
separation, many diffusion layers must be connected in series. This method was at first 
the most important technology for uranium separation: the gaseous compound UF^ is 
employed to enrich the uranium isotope relative to 

The gas centrifuge is also applied on a large scale for uranium separation. Here, the 
heavier isotope is acted upon by a stronger centrifugal force. The lighter isotope is en- 
riched in the region of the centrifuge axle. For effective separation, many stages must 
be employed one after another. The most serious technical problem is the strength of 
the materials used in view of the extreme accelerations necessary. 

The separation tube utilises thermodiffusion: it is based on the principle that a tem- 
perature gradient in a mixture of gases leads to a separation of the mixture; the effect is 
increased by convection. Along the axis of a long tube, a heater wire is suspended. The 
lighter isotope is enriched by thermal diffusion in the middle and at the top, the heavier 
isotope collects by convection at the outer wall and at the bottom of the tube. 

Fractional distillation in repeated steps uses the fact that the heavier isotope in 
general has the higher boiling point. For example, the boiling point of heavy water 
(D 2 O) lies 1.42 degrees above that of H 2 O. 
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In electrolysis, molecules with the heavier isotope are less easily decomposed than 
those with the lighter isotope. This technique is used for large-scale separation of heavy 
and light hydrogen. 

There are also chemical reactions in which molecules with differing isotopic compo- 
sitions react with different rates. In such cases, isotope separation can be achieved 
through chemical reaction. Since the availability of narrow-band, tunable light sources 
in the form of dye lasers, (see Chap. 21), laser photochemistry can also be used for iso- 
tope separation. In this method, certain isotopes in a mixture of molecules composed 
of various isotopes can be selectively photoexcited, leading to photochemical reactions 
of the selected molecules. Some interesting new techniques for isotope separation have 
been developed in recent years based on this principle. 



Problems 

3.1 Show that a transverse homogeneous magnetic field can be used to sort charged 
particles according to their momenta, and to sort monoenergetic particles according to 
their masses. All the particles have the same charge. 

3.2 An ion beam containing 'H'^, + , and is accelerated through a voltage of 

1000 V and is directed perpendicular to the field lines of a 0.05 tesla magnetic field. 
How far apart are the component beams when they have travelled 5 cm through the 
homogeneous magnetic field and are measured at a distance of 25 cm from the begin- 
ning of the magnetic field? 

3.3 A beam of positive ions traverses for a distance / = 4 cm an electric field 
l^l = 5000 V/m and a parallel magnetic field |S| = 0.01 tesla. The ions travel perpen- 
dicular to the direction of the two fields (parabola method). They then cross a field-free 
region /' = 18 cm and land on a flat fluorescent screen. What are the parameters of the 
parabolas on the screen if the beam consists essentially of singly charged hydrogen ions 
and hydrogen molecules with a velocity corresponding to an accelerating voltage 
between 1000 V and 4000 V? What does the image on the screen look like if both 
positively and negatively charged ions are directed at it? 

3.4 The isotopic abundance of and in naturally occurring uranium is 0.72% 
and 99.28%, respectively. If the isotopes are separated by diffusion, the isotopic mix- 
ture after one separation step is 0.754% How many separations are needed to 
enrich the to 50%? to 99%? 

Hint: The separation coefficient a = (relative concentration before separation)/(relative 
concentration after separation) is independent of the composition of the isotopic mixture. 
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4.1 Passage of Electrons Through Matter 

Beginning in the 1890s, Lenard investigated the attenuation of electron beams passing 
through matter. This attenuation can have two causes: the electrons can lose their 
energy by exciting or ionising atoms, or they can be elastically scattered and so change 
their directions and leave the beam. Lenard produced the beam by means of a cathode 
ray tube. Today, one would use thermionic emission as the electron source. 



Cathode Experimental 




Fig. 4.1. Schematic of Lenard' cath- 
ode ray tube. The cathode rays pass 
through a thin foil - the Lenard 
window - into the surrounding air 
and excite it, producing fluorescence 



An important result may be obtained from the qualitative experiment illustrated in 
Fig. 4.1: if the gas discharge tube used for producing the cathode rays is closed with an 
extremely thin aluminium foil, it may be observed that electrons from the cathode ray 
beam pass through the foil. They excite the air for a distance of several centimeters out- 
side the tube, yielding a bluish-red fluorescence light, and can be detected several cm 
away from the end of the tube by using a fluorescent screen. An aluminium window of 
this type, with a thickness of ca. 5 • 10“"^ cm, is called a Lenard window. The experi- 
ment offers visible proof that the electrons can pass through some 10000 atomic layers 
as well as several cm of air at NTP. Under the assumption that atoms were im- 
penetrable for electrons, the scattering of electrons by air would take place over a 
length of the order of the gas kinetic mean free path, i.e., in a range of about 10~^ cm. 

From such qualitative experiments, it follows that the interaction cross section for 
collisions of an electron from a gas discharge tube with atoms is small compared to the 
cross section for collisions between two atoms. 

For the quantitative determination of the interaction cross section between elec- 
trons and atoms, one may employ a setup analogous to that shown in Fig. 4.1, where, 
however, the cathode ray beam passes through the Lenard window into a scattering 
chamber. In the chamber, the electron current is measured after the beam has passed 
through a gas atmosphere of known composition and density. The collisions of the 
electrons with the atoms in the foil can also be investigated; for this purpose, the ex- 
perimental parameters (foil thickness, foil material, pressure and composition of the 
gas, and distance between foil and electron detector) may all be varied. The interaction 
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cross section is obtained from the ratio of incident (Iq) and transmitted (/) electron in- 
tensities by means of the equation derived above (2.22): 

/(x) = /oe-^ (4.1) 

where x indicates the thickness of the scattering layer. 

It may be shown that: 

- The absorption or scattering coefficient a is proportional to the pressure in the scat- 
tering chamber. This is in agreement with the definition of the total interaction cross 

n 

section given earlier as being equal to the sum of the partial cross sections, a=Yj 

/= 1 

since, for identical scattering particles, a= an\s> then the sum of all the interaction 
cross sections per unit volume, where n gives the number of particles per unit vol- 
ume and is proportional to the gas pressure. 

- In foils and in gases, independent of the phase of matter and of the particular prop- 
erties of the material, for a given electron velocity it is found that a/g = const, i.e., 
the interaction cross section is proportional to the density q of the scattering mate- 
rial. 

- With increasing electron velocity, the ratio a/g decreases strongly (Fig. 4.2). 



Fig. 4.2. Qualitative behaviour of the interaction 
cross section for gas atoms and electrons as a func- 
tion of the electron velocity. The collision radius R, 
which is connected with the cross section a by means 
of nR^ = G (Sect. 2.4.2), is plotted against the ratio 
v/c of the electron velocity to the velocity of light 




In Fig. 4.2, the collision radii calculated from the measured interaction cross sec- 
tions a are plotted as a function of the electron velocity. For very fast electrons, atoms 
are thus a factor of 10^ more penetrable than for slow electrons. The experiments lead 
to the following conclusion: only a small portion of the atomic volume is impenetrable 
for fast electrons, or, as expressed by Lenard - the inside of an atom is as empty as in- 
terplanetary space. 

At first, the only general conclusion which could be drawn from this result was that 
the mass and charge in an atom are distributed in a “lumpy” fashion, rather than being 
evenly spread throughout the atomic volume. Lenard spoke of nuclei and force fields. 
The analogy with a solar system was tempting. Today, we know that slow electrons are 
scattered by the atomic electron cloud, while fast electrons are scattered by the nucleus 
only. 

The realisation that there is one small nucleus, which contains the entire positive 
charge and almost the entire mass of the atom, is due to the investigations of Ruther- 
ford, who utilised the scattering of alpha particles by matter. 




4.2 Passage of Alpha Particles Through Matter (Rutherford Scattering) 
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4.2 Passage of Alpha Particles Through Matter (Rutherford Scattering) 

4.2.1 Some Properties of Alpha Particles 

Alpha particles are emitted by some radioactive nuclei. They consist of doubly ionised 
helium nuclei, 2 He^“^, with high kinetic energies (several MeV). They can, for example, 
be detected by means of their ability to ionise air in a cloud chamber; alpha particles 
with an energy of 5 MeV have a range of about 3.5 cm in air at NTP. In this distance, 
they lose their initial kinetic energy to the air molecules through ionisation and excita- 
tion processes. Since the mean free path for atoms or molecules as calculated by the 
kinetic theory of gases amounts to about 10“^ cm, we see that alpha particles can 
penetrate and pass through thousands of atoms (3.5 cm/10 3.5 • 10^ atoms) with- 
out being noticeably deflected from a straight path. Cloud chamber pictures show that 
the paths of the alpha particles are for the most part straight; only near the ends of the 
tracks, when the particles have lost most of their kinetic energy and are moving slowly, 
do we observe large deflections from straight-line paths (Fig. 4.3). Another possibility 
for observing the paths of alpha particles is offered by the spinthariscope or the 
scintillation detector (Figs. 2.16 and 2.17). Using scintillation detectors, Geiger and 
Marsden investigated the scattering of alpha particles in matter, which we will now 
treat in detail. 

I Fig. 4.3. Cloud chamber photograph of the track of an alpha particle, by Wilson. 

The particle passes through several cm of air without noticeable deflection. At the 
end of the track, we see two deflections; at the second, we can also see the short 
track of the target nucleus, which was accelerated to the right by the collision. 
[From W. Finkelnburg: Einfuhrung in die Atomphysik, 11,12th ed. (Springer, 
Berlin, Heidelberg, New York 1976) Fig. 3] 



4.2.2 Scattering of Alpha Particles by a Foil 

In order to investigate the interaction cross section for collisions between alpha par- 
ticles and atoms quantitatively, Rutherford and coworkers utilised the following ex- 
perimental setup (Fig. 4.4): 
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The alpha particles, which are emitted by naturally radioactive material R, pass 
through a collimator and strike a thin metal foil F. The transmitted alpha intensity is 
determined by means of a scintillation screen S, observed through the lens L. In con- 
trast to the determinations of interaction cross sections described above, in 
Rutherford’s experiments the directly transmitted alpha intensity was not the main ob- 
ject of the investigation; instead, the dependence of the scattered intensity on scattering 
angle 6 was determined, d is the angle between the directions of the deflected and the 
incident particle beams (Fig. 4.7). Scattering experiments of this type have become one 
of the most important tools in nuclear physics. A typical experiment yields a result like 
the one shown in Fig. 4.5. 

The scattered intensity decreases strongly with increasing scattering angle. The 
angular dependence is well described by the inverse fourth power of the sine of half the 
scattering angle. At large scattering angles, deviations from this dependence are seen; 
we will treat this so-called anomalous Rutherford scattering in Sect. 4.2.4. 

It is further observed that scattering occurs even at very large angles. It can be con- 
cluded that this is not due to multiple scattering processes; in scattering of alpha par- 
ticles by helium atoms in a cloud chamber, large deflection angles, namely 90°, can be 
seen directly. An example is shown in the cloud chamber photograph in Fig. 4.6. 

A quantitative explanation of these results may be given with the help of the 
Rutherford atomic model (1911). The model states that; 





Fig. 4.4. Experimental setup for the investigation of 
Rutherford scattering: alpha particles from the 
radioactive source R are scattered by the foil F. The 
light flashes which are produced by the scintillation 
screen S are observed through the observation tele- 
scope L. The chamber can be evacuated and the ob- 
servation lens L may be turned around the foil axis 
by means of the ground-glass joint J 
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Fig. 4.5. Graphical representation of the experimen- 
tal results of Geiger and Marsden for the Rutherford 
scattering of alpha particles by a gold foil. The scat- 
tering rate is plotted as a function of the scattering 
angle 6. The solid curve represents the theoretical 
function for Coulomb scattering 
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Fig. 4.6. Cloud chamber photographs of 
alpha particles. Collision processes with 
the gas in the chamber can be seen; left, 
the chamber gas is hydrogen, right, it is 
helium. In hydrogen, the alpha particle is 
only slightly deflected from a straight- 
line track, while the hydrogen target nu- 
cleus recoils sharply off to the left. In he- 
lium, the angle between the tracks of the 
alpha particle and the recoiling nucleus 
after the collision is 90°, since the two 
particles have the same mass. [From K. 
H. Hellwege: Einfuhrung in die Physik 
der A tome, Heidelberger Taschenbii- 
cher, Vol. 2, 4th ed. (Springer, Berlin, 
Heidelberg, New York 1974) Fig. 4] 



- Atoms have nuclei with a radius R of about 10“^^ cm. The nucleus contains nearly 
the entire mass of the atom. A collision between an alpha particle and a much lighter 
atomic electron produces no measurable deflection in the alpha particle’s path. 

- The atomic nucleus has a positive charge Ze, where Z is the position of the element 
in the periodic table. 

- Around the positively charged nucleus is a Coulomb field given by (at distance r) 

E = (i/4nso) — E. (4.2) 

r r 

4.2.3 Derivation of the Rutherford Scattering Formula 

The above model leads to the Rutherford scattering formula (4.20) if we take into ac- 
count only the Coulomb repulsion between the nuclear charge and the charge of the 
alpha particle. We will use the model to calculate the dependence of the scattering 
probability on the deflection angle in two steps: first, for a single scattering event we 
determine the dependence of the deflection angle on the impact parameter b, which is 
the distance of closest approach of the alpha particle to the target nucleus, assuming no 
deflection occurs (see Fig. 4.7). We shall see that a unique relation between the impact 
parameter b and the deflection angle 6 exists. Secondly, we will average over all pos- 
sible impact parameters, since we cannot follow a single alpha particle on its path 
through the target foil, but rather observe the scattering of many alpha particles. 
Multiple scattering will not be considered; for the experiment, this means that the 
target foil must be sufficiently thin that each alpha particle is only scattered once on 
passing through the target. 

In order to calculate the path of the particle we recall the motion of a planet under 
the influence of an attractive gravitational field. The effective force is proportional to 
1/r^ where r is the sun-planet distance. The orbits which one finds in this case are 
known to be either elliptical, parabolic, of hyperbolic. 

Since the Coulomb force has the same dependence upon distance r as the gravita- 
tional force, the orbital calculations from celestial mechanics can be utilised directly. 
Admittedly, since the Coulomb force is here repulsive, only the hyperbolic orbits 
represent possible solutions when we are dealing with charges of the same signs. 
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Fig. 4.7. Rutherford scattering. Upper part: 
Schematic illustration of the experimental setup. 
Lower part: The geometry of the model calcula- 
tion. The alpha particle is deflected from A to B 
through scattering by the nucleus Ze 



We now wish to determine the relation between the scattering angle 6 and the 
impact parameter b (Fig. 4.7). The particle arrives at point A, still distant from the 
nucleus, with a velocity Vq, If it were not deflected, it would pass the nucleus at a dis- 
tance b. Between the alpha particle and the nucleus, there is a repulsive Coulomb force 
F 



IZe^ 1 
47r£o'’^ ^ 



(4.2a) 



with the nuclear charge Ze, the elementary charge e, the permittivity constant Sq, and a 
distance r between the nucleus and the alpha particle. 

We assume that the particle has reached point M in its orbit and express the force 
which acts there in terms of two components: 



F ^ = Fsin 0 perpendicular to the original direction , (4.3) 

and 

\F| I = Fcos 0 antiparallel to the original direction . (4.4) 

0 is the angle between the horizontal (i.e., the direction of the incident beam) and the 
radius vector r to the momentary position of the particle. 

We now apply the law of conservation of angular momentum, placing the origin of 
the coordinate axes at the centre of the atomic nucleus. Since the force which acts here 
is radial (4.2a), it produces no torque and the angular momentum is constant; in partic- 
ular, the angular momenta at the points A and M are the same, or, mathematically. 
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(mroZ7)A = (mr^0)M (4.5) 

in which we have used polar coordinates (r, 0). Solving for \/r^ yields 

1/r^ = 0/vob . (4.6) 



If we consider only the motion perpendicular to the original beam direction, New- 
ton’s equation of motion reads 



m 



dv^ „ 2Ze^ 1 . ^ 

= F, = s-sm(/». 



dt 



^ AnEn 



(4.7) 



If we replace 1/r^ in this equation with the right-hand side of (4.6) and integrate over 
time, using the abbreviation k = IZe^/AuEQ, we obtain 



dt mvob A dt 



(4.8) 



In order to determine the limits of the integral, we imagine the point A to be infinitely 
distant from the nucleus. Since now no Coulomb force acts, we have = 0, and the 
angle 0 = 0. 

To determine the scattering angle 6 between the incident direction and the direction 
of the particle after scattering, we let point B (see Fig. 4.7) move away to infinity. Then 
the angle 0 is seen to be related to 6 through the expression 0 = 180° - 9. Because of 
conservation of energy, the final velocity at the point B is equal to the initial velocity Vq 
at point A, since the potential energy vanishes at a sufficiently large distance from the 
nucleus. The component has, using 0 = 180° - 6, the value v^_ = VQsinO. Then the in- 
tegral equation (4.8) becomes, using 

dt = dv^ and ^^dt = d0, 
dt dt 



the following equation: 

uosin^ n-6 

J dv^ = J sin0(i0. (4.9) 

0 mv^b 0 

Upon integration, we obtain 

Posin 9 = — - — (1 + cos 9 ) . (4.10) 

mvob 

With the trigonometric identity 

ll^^ = cot(e/2) (4.11) 

sin0 

we obtain the relation between the impact parameter and the deflection angle which we 
are seeking: 
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6 = — ^cot(6)/2) . (4.12) 

mvQ 

In an actual experiment, we cannot measure the number of scattered particles arriv- 
ing at a particular angle 6, but rather we have to consider the finite range of angles be- 
tween 9 and 9+d6; these correspond to impact parameters in the range b to b + db. 
Then, by differentiating (4.12), we obtain the relation between db and dO: 

db=-—!^ — ^ dd. (4.13) 

2m Vq sim(0/2) 




Fig. 4.8. Rutherfor(i scattering. The incident 
alpha particles with impact parameters in the 
range biob + db are deflected into the range 
of angles 0 io 9- \ d0\ 



Finally, we have to consider that the whole problem has rotational symmetry 
around an axis through the target nucleus and parallel to the direction of the incident 
beam (Fig. 4.8). Therefore, we have to consider a circular ring with radii r^ = b and 
a *2 = b + db, through which the incident beam enters and is scattered into the angular 
region from 6- \ d0 \io 9. [We note that with increasing impact parameter b the angle 9 
becomes smaller, see (4.12)]. This range of angles corresponds to an “effective area”, 
the differential cross section da: 

da = 2nbdb . (4.14) 

If we shoot alpha particles through a thin foil with thickness D and area contain- 
ing atoms/cm^, the “effective area” of all the atoms is 



dA = 2nbdbNDA (4.15) 

with the condition that the “effective areas” of the atoms do not overlap one another, 
which is a good assumption in a thin foil (up to 10000 atomic layers). 

The probability that an incident alpha particle strikes the “effective area” of an 
atom in the foil is given by 



W = 



“effective area” 
total area 



= dA/A = 2nNDbdb. 



(4.16) 



With a total of n alpha particles, the number dn' of the particles which strike the 
“effective area” and thus are deflected into the angle range 9 to 9- \ d9\h given by 
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dn' = n • InN D b db . (4.17) 

These particles pass through the unit sphere around the target foil on a ring of area 
n^\n6\d0\. In the following, it is convenient to use the half-angle 6/2\ doing 
so, we obtain 

= 471 sin(l9/2) cos(l9/2) \d6\, (4.18) 

The detector which is used in the measurement cuts out a segment dQ from this ring- 
shaped area. This surface element on the unit sphere is called a solid angle. The number 
of particles actually measured is therefore smaller than the number dn' by the ratio 
dQ/dQ^^\ If the detector subtends a solid angle of dQ, the number of particles ob- 
served at angle 6 is given by 



dn = dn' • dQ/dQ^^\ 



(4.19) 



Inserting b and db from (4.12) and (4.13), we obtain the full Rutherford formula: 



dn{6,de) 

n 



Z^e^DN 

(4 ^ ^ 0 sin'^(0/2) 



dQ 



(4.20) 



with n the number of incident particles, dn the number of particles scattered at an angle 
6 into the solid angle dQ, Z the (target) nuclear charge, e the elementary charge, D the 
target foil thickness, the number of atoms/cm^ in the target foil, dQ the solid angle 
subtended by the alpha particle detector, Sq the permittivity constant of vacuum, m the 
mass of the scattered (alpha) particles, vq the velocity of the incident particles, and 6 the 
angle of deflection. 

This formula tells us how many particles dn out of the incident number n are scat- 
tered at a particular angle 6 into a particular solid angle dQ, when target properties and 
incident particle velocity are known. Corresponding to (4.20), we find for the differen- 
tial cross section (4.14) 



da = 






(AneoYm^ visin' {0/2) 






(4.21) 



Furthermore, it is useful to define the macroscopic (differential) cross section Nda, 
which is equivalent to the “effective area” dA per unit volume. By integration of (4.21) 
over Q^^\ we can obtain the total interaction cross section a\ the latter, however, di- 
verges in the present case of a pure (unscreened) Coulomb potential, since (4.21) di- 
verges for 0. In the Rutherford scattering formula (4.20) for scattering by a foil, the 
limiting case 0^0 is in principle not physically relevant: this is a result of the model, 
since 6=0 means that b = oo. An infinite value of the impact parameter is, however, 
unreasonable given the assumed dense packing of the target atoms; the largest possible 
impact parameter is equal to half the distance between target atoms in the foil. For 
6= n, dn/n shows a minimum. This corresponds to b = 0. For very small impact 
parameters, there are deviations between the results of the calculation using the scatter- 
ing formula (4.20) and the experiments. This occurs because the model of a deflection 
of the alpha particles by the Coulomb field of the nuclei alone is insufficient. From the 
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values of the impact parameter b for which these deviations become important, we can 
determine the nuclear radius R. This will be discussed in the following section. 

4.2.4 Experimental Results 

The Rutherford formula has been experimentally tested with great care. Keeping the 
solid angle constant, the sin"'^(0/2)-law is found to be excellently reproduced in the 
counting rate (Fig. 4.5). Even with alpha particles of energy 5 MeV and scattering 
angles of 150°, no deviations from the Rutherford formula are found; this corresponds 
to an impact parameter of 6 • 10“^^m. In this region, only the Coulomb potential of 
the nucleus has a measurable effect on the alpha particles. 

The experimental tests of the Rutherford scattering formula can be summarised as 
follows: 

The Coulomb law is obeyed well even at very small impact parameters, since the 
Rutherford formula is still valid. The nuclear radius is thus 

i?<6- 

From experiments with different foil materials, the nuclear charge Z can be deter- 
mined. The experiments of Chadwick (1920) verified that Z is identical with the posi- 
tion of the element in the periodic table. 

The nucleus was originally assumed to be constructed from A protons and (A - Z) 
electrons where A is the mass number defined on p. 6. After 1932 it was known that this 
model is not correct; {A -Z) is rather the number of neutrons and Z is the number of 
protons in the nucleus. 

We come now to the so-called anomalous Rutherford scattering. In the scattering of 
very fast alpha particles (£>6 MeV) at large angles 6, i.e., with small impact para- 
meters b - nearly central collisions - one observes clear deviations from the Ruther- 
ford formula. Here the Coulomb law is apparently no longer obeyed. The alpha par- 
ticles approach the nuclei so closely that another, short-range interaction force be- 
comes effective: the nuclear force. From the values b and 6 at which deviations from 
the Rutherford formula, i.e., from the Coulomb law begin to occur, a nuclear size of 
/? = 10"^^ m is obtained. This means that the density of the nucleus is about 10^^ times 
larger than the density of the atom as a whole. These deviations from the scattering be- 
haviour expected on the basis of the Rutherford formula are called anomalous 
Rutherford scattering. 

The Rutherford model may be developed further. Negative electrons orbit around 
the positively charged nucleus with nuclear charge Z. This represents a dynamic equi- 
librium: without the motion of the electrons, no stability would be possible. If deflec- 
tions of alpha particles through large angles are possible without causing a noticeable 
energy loss on the part of the alpha particles, then the mass of the target nucleus must 
be large compared to that of the alpha particle. On the other hand, observations with 
cloud chambers filled with helium gas, in which the target and the projectile, i.e., a He 
atom and an alpha particle, have virtually the same mass, show deflections of about 
90°. From such experiments one can show that the nucleus must contain nearly the 
whole mass of the atom. 

By contrast, momentum conservation requires that in a collision between an alpha 
particle and an electron, due to the small electron mass only very little momentum can 
be transferred. With the electron/alpha particle mass ratio, the deflection of the alpha 
particles can be no larger than 28". 
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For very large impact parameters (small deflection angles), the Rutherford formula 
is likewise no longer exactly valid. The Coulomb potential of the nucleus is perturbed 
by the atomic electrons. These effects occur for 6 ^ 10"^^ cm (deflection angles of a 
few seconds of arc) and are very difficult to detect experimentally. Completely 
analogous scattering formulae and scattering problems occur in the scattering of 
protons by atomic nuclei. The angular dependence of the scattering is related to the 
scattering potential; the latter can thus be determined from experiment. Scattering pro- 
cesses play an important role in nuclear and elementary particle physics, in the investi- 
gation of the internal structure of nuclei and of certain elementary particles. For 
example, Hofstadter\v2iS granted the Nobel Prize in 1961 for his scattering experiments 
using fast electrons (10^ eV) on protons and neutrons. From the angular dependence of 
the scattering intensity, he was able to obtain information about the inner structure of 
the proton and of the neutron. 

4.2.5 What is Meant by Nuclear Radius? 

We can summarise our considerations in the above sections as follows: an alpha par- 
ticle, which approaches a nucleus from outside the atom, is acted on at first only by the 
repulsive Coulomb potential. If it approaches the nucleus sufficiently closely, it will 
also be acted upon by the attractive nuclear force. The nuclear radius is defined as the 
distance at which the effect of the nuclear potential is comparable to that of the 
Coulomb potential (Fig. 4.9). For such investigations, alpha particles of high kinetic 
energies are used, so that they can approach the nucleus sufficiently closely. 




Fig. 4.9. Nuclear force and Coulomb potentials, 
used for defining the nuclear radius R 



The empirical result of such measurements on nuclei with the mass number A is 
found to be 

Numerical examples for >4 = 12 and A = 208 are: 
i?(’iC) = 2.7-10"’^m, /?(^g|Pb) = 7.1 
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This relationship between the nuclear mass and the nuclear radius implies that the 
density of nuclear matter is constant and independent of the size of the nucleus. This is 
one of the experimental results underlying the liquid-drop model for nuclei. 



Problems 

4.1 An aluminium foil scatters 10^ a particles per second in a given direction and solid 
angle. How many a particles will be scattered per second in the same direction and solid 
angle if the aluminium foil is replaced by a gold foil of the same thickness? 

4.2 The number of a particles scattered from a foil into a counter is 10^ s “ ^ for a scat- 
tering angle of 10°. Calculate from this the number of a particles which will be scat- 
tered into this counter as it is moved on a circular path from 10° to 180°. Show your 
results for N{8) graphically. 

4.3 Determine the distance of the closest approach of protons to gold nuclei in head- 
on collisions in which the protons have kinetic energies of (a) 10 MeV and (b) 80 MeV, 
and compare the results with the nuclear radius. In which case would the proton 
“touch” the nucleus? Determine the kinetic energy of the proton when it “touches” the 
nucleus. 

4.4 Through what angle is a 4 MeV a particle scattered when it approaches a gold 
nucleus with a collision parameter of 2.6 x 10“^^ m? 

4.5 How large is the collision parameter of an a particle with 4 MeV kinetic energy 
which is scattered through the angle 0 = 15° by collision with a gold nucleus (Z = 79)? 

4.6 A beam of a particles with 12.75 MeV kinetic energy is scattered off a thin 
aluminium foil (Z = 13). It is observed that the number of particles which are scattered 
in a certain direction begins to deviate from the value calculated for pure Coulomb 
scattering at the deflection angle 0= 54°. How large is the radius of the A1 nucleus if 
one assumes that the a particle has a radius i?cr=2xl0“^^m? 

Hint: Calculate the orbit according to (4.8 and 9) up to 0o = (180- 0)/2, the point of 
closest approach, and determine r(0o)* 

4.7 A tight bunch of protons with uniform energy strikes a 4 pm thick gold foil per- 
pendicular to the direction of flight. Of these protons, the fraction ^ = 1.35 xlO"^ is 
scattered through the angle 6 = 60° in the angular interval dO. 

a) What is the kinetic energy of the colliding protons? 

b) Calculate the differential effective cross section da{6)/dQ of the gold nucleus. 

c) What is the collision parameter bl 

Hint: Use (4.20) and the expression 
dn/n daje) 

dQ dQ 

4.8 Why are a particles used for scattering from the gold atoms in the Rutherford 
experiment, and not electrons? What advantages and/or disadvantages would neutrons 
have as projectiles? 
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5.1 Wave Character of Light 

The fact that light can be regarded as a wave phenomenon was experimentally shown in 
the 17th and 18th centuries by the Dutch physicist Huygens and the English physician 
Young with the aid of interference experiments. In the 19th century, the physical nature 
of these waves came to light: they are electromagnetic waves, described by Maxwell’s 
equations. They are characterised by the field vectors E and B of the electric and the 
magnetic field and exhibit a periodicity with the frequency oj. 

In the year 1885, the theory of electromagnetic phenomena was completed with the 
formulation of the Maxwell equations. Two years later (1887), Hertz succeeded in 
demonstrating that such waves can be produced in the laboratory as emissions from an 
oscillating dipole. According to Maxwell, an electric and a magnetic field propagate 
away from an accelerated charge with the velocity of light. The accelerated charge 
radiates energy. The emission of light in the oscillator model is a result of a high fre- 
quency oscillation carried out by a charged particle. In absorption and in scattering of 
light, the incident electromagnetic wave excites the oscillator to forced oscillations. 
This classical Maxwell theory permits the precise calculation of the electromagnetic 
waves which are emitted by radio and radar antennas. Furthermore, it completely 
describes all of the wave properties of electromagnetic radiation, for example inter- 
ference and diffraction. 

Electromagnetic waves may be produced over an extremely wide range of 
frequencies (see Fig. 8.1); for this purpose, a number of different processes may be 
used. 

Besides the oscillating dipole, some other examples are: 

- the emission of light by the electrically charged particles in particle accelerators. 
Here, the synchrotron radiation is particularly noteworthy. The circulating particles 
in a circular electron accelerator emit radiation with a continuous spectrum. This 
radiation is utilised - for example at the German Electron Synchrotron (DESY) in 
Hamburg - as an intense, continuous, polarised light source for spectroscopy in 
the near, mid-, and far ultraviolet spectral regions. Figure 5.1 shows a schematic 
illustration of the accelerator in Hamburg. In Fig. 5.2, the spectrum of the syn- 
chrotron radiation is indicated. At relativistic particle energies, i.e., when the 
particle velocity is no longer small compared to the velocity of light, the emitted syn- 
chrotron radiation energy is a considerable fraction of the total energy which must 
be expended to operate the accelerator. 

Originally, electron synchrotrons were constructed as particle accelerators for 
high-energy physics research, and the synchrotron radiation was only a byproduct. 
Today, special storage rings (“dedicated sources”) are built expressly for producing 
radiation, and yield intense beams of photons up to very high energies. An ex- 
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ample of such a large “third-generation” machine is the European Synchrotron 
Radiation Facility (ESRF) in Grenoble, France. 

- A radiation emission which is produced in a similar manner and which is also called 
synchrotron radiation occurs when charged particles become trapped in the 
magnetic field of the earth. This phenomenon also occurs in distant regions of 
space, for example in the famous Crab nebula. Various astronomical objects are 
known to emit radiation in frequency regions from the far ultraviolet down to radio 
frequencies. 

- A negative acceleration of electrons - for example a slowing down in the field of an 
atomic nucleus - leads to the emission of x-rays, the so-called bremsstrahlung. 

- The thermal radiation of the sun is the energy source for all life on the earth. 
While the wave character of light must be considered to be an experimentally and 

theoretically well-established fact, especially because of diffraction and interference 
phenomena, there are, on the other hand, experiments in which light behaves as 
particles; these are cdiWtd light quanta or photons. 

Before we describe the experiments which demonstrate the particle nature of light, 
we will summarise the most important physical properties of photons. 



Photon 

Energy h v 

Velocity c 

Rest mass m = 0 



Momentum p = hv/c = h/X 

Intrinsic angular 
momentum (spin) h/2 n 




Fig. 5.1. A schematic representation of the electron synchrotron DESY and the 
storage ring DORIS in Hamburg. Electrons are preaccelerated by linear accelera- 
tors; positrons are generated in a converter target and stored in the accumulator 
storage ring PIA. Then both kinds of particles are further accelerated in the syn- 
chrotron DESY II and injected into the DORIS ring, where they can be stored for 
several hours. Alternatively, the particles may be accelerated to still higher energies 
in the former storage ring PETRA, and then injected into the electron-proton 
storage ring HERA (not shown in the drawing), where they are used for experi- 
ments in high-energy physics. The DORIS storage ring is used both for high-energy 
physics and as a large-scale source of photons for experiments with synchrotron 
radiation. In the facilities of the Hamburg Synchrotron Radiation Laboratory 
(HASYLAB) and the European Molecular Biology Laboratory (EMBL), about 30 
experimental stations have been constructed for synchrotron radiation experi- 
ments. (Provided by C. Kunz, Hamburg University) 




Fig. 5.2. The spectral intensity distribution of the syn- 
chrotron radiation from various electron synchrotrons 
and storage rings (photons/s eV mrad“^). The radia- 
tion is continuous from the visible into the x-ray 
regions. In the far-ultraviolet and soft x-ray regions, 
an electron or positron storage ring is currently the 
best radiation source available. In addition to the 
spectrum emitted from the bending magnets of 
BESSY (in Berlin) and HASYLAB (in Hamburg), we 
show the considerably higher photon flux from the 32- 
pole “wiggler” W1 at HASYLAB. (Provided by 
C. Kunz, Hamburg University) 
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For converting from the quantum energy of a photon E = hv, which is often expressed 
in eV, to the vacuum wavelength Ayac of the light wave, the following relation holds: 

£[eV] = 12398/A [A]. (5.1) 

In the next sections we will describe three experiments which can only be under- 
stood by assuming the existence of photons. 
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5.2.1 Spectral Distribution of Black Body Radiation 

The quantisation of energy in the interaction of light with matter was postulated for the 
first time by Planck in the year 1900 in his theoretical analysis of the spectral distribu- 
tion of the light emitted by a black body radiator (defined below), which had been ex- 
perimentally determined. This light is referred to as thermal or black body radiation. 

Hot objects emit light radiation as a result of their temperature. This is an everyday 
experience. It is well known that the colour which we see in a thermal radiator (for 
example a furnace) changes from dark red to bright red to yellow to white as the tem- 
perature of the furnace is increased. The determination of the colour in the interior of a 
furnace is used as a measure of its temperature; this technical application is called 
pyrometry. 

At temperatures below a few hundred kelvins, the radiation emitted is for the most 
part infrared light, also called heat radiation. This infrared radiation is responsible for 
the fact that a thermally isolated object eventually reaches the same temperature as its 
surroundings. If one wishes to carry out experiments at very low temperatures (e.g., 4.2 K 
and below), the experimental region must therefore be screened from the thermal radia- 
tion of the laboratory, which is at room temperature, by using cooled thermal shields. 

The laws governing the spectral intensity distribution of thermal radiation are 
obtained by the experimental analysis of the black body radiator. This is a cavity which 
emits radiation being in thermal equilibrium with its walls; the material of the walls 
emits and absorbs thermal radiation. 

Experimentally, a source of black body radiation is most easily obtained by making 
a small hole in the wall of a cavity which is held at constant temperature. The hole is so 
small that neither the radiation which enters the cavity from outside, nor that which 
escapes to the outside, is sufficient to alter the thermal equilibrium in the cavity 
(Fig. 5.3). The energy density w(v, T) of the radiation field within the cavity can be 
determined by measuring the radiative power (energy per unit time) N{v, T) connected 
with the spectral radiative flux density P(v, T) which passes out of the hole, using a 
spectrometer. The spectral energy density u{v,T) is defined as 



w(v, T)dv 



radiation energy in the frequency range v. . . v + rfv 
volume 



and the spectral radiative flux density 2P(v, T) as 



2P{v,T)dv 



radiative power in the frequency range v. . . v + ^/v 
area • solid angle 




Intensity 
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Temperature T 



Fig. 5.3. Thermal radiation. Upper part: Schematic 
illustration of a cavity radiator. Lower part: Typical 
measured curves of the intensity distribution in the 
black body radiation at various temperatures 
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2P(v, T)dv '\s the quantity of energy within the frequency interval v. . . v + rfv which 
is radiated per unit time through a unit area into the solid angle 1 sterad normal to the 
surface. The factor 2 in the definition comes from the fact that the radiation can be de- 
composed into two components with polarisation directions perpendicular to each 
other. P(v, T)dv for one component, i.e., for linearly polarised radiation, is independ- 
ent of the polarisation direction for a black body radiator. For the energy radiated 
from a surface A at an angle 6 to the surface normal as unpolarized radiation into the 
solid angle Zl we have (in the time interval A t and the frequency range v . . . v + rfv): 

E = u(v, T)dvc- A t-(AQ/4n)A cose . (5.2) 

Figure 5.3 shows some typical experimental results. The radiation has a continuous 
spectrum with a prominent maximum, which lies in the infrared when the black body 
radiator is at room temperature. 

The following results are important: 

- At a given temperature, the energy distribution is the same, independently of the 
shape and material of the cavity. 

In (5.2), the radiation in a frequency interval from v to v + rfv at an angle 6 is 
calculated. The radiated power of the surface element per polarization direction in 
all directions into a hemisphere can be found by integration over angles: 

^ 2n n/2 ^ 

P(v, r)tot = — u(y,T) j j cos 6 sm 6 dO = -u{y,T) . (5.2a) 

Stt 0 0 8 

The limits of integration here imply that the surface can only radiate outwards (into 
a hemisphere, see Fig. 5.3). From this expression we obtain the total radiation at 
a temperature Thy integrating over all frequencies and polarization directions: 
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S = l]p{y,noidv = a-T* . (5.3) 

0 

This is the Stefan-Boltzmann Law, with a = 5.670 -10“^ Wm“^ 

- The Wien Displacement Law holds for the wavelength at which the maximum inten- 
sity Ajnax occurs in the emitted spectrum, as a function of temperature: 

Amax T = const = 0.29 cm K . (5 .4) 

As an example we can take the solar radiation: the surface temperature of the sun is 
6000 K; the wavelength at maximum intensity is A^ax = 480 nm. 

- The law derived by Rayleigh and Jeans from classical electrodynamics, 

P = ^kT (5.5) 

describes the radiative flux density per polarisation direction very well for low fre- 
quencies. However, at high frequencies, this law cannot be correct: if it is integrated 
over all frequencies, it yields an infinitely large energy density - we encounter the 
so-called ultraviolet catastrophe. Within classical electrodynamics and thermo- 
dynamics it was not possible to find an expression for P which agreed with experi- 
ment at high frequencies. This was accomplished for the first time by the Planck 
formula. 



5.2.2 Planck’s Radiation Formula 



According to Planck, the experimentally determined spectral energy density of the 
radiation per unit volume, in the frequency interval v. . . v-hc/v, can be represented by 



w(v, T)dv = 



Snhv^ 



1 



dv . 



(5.6) 



From this we obtain for the radiative flux density per polarisation direction and solid 
angle 



P(v, T)dv = 



hv^ 



dv . 



These radiation formulae may be derived by making the following assumptions: 

1) The atoms in the walls of the cavity behave like small electromagnetic oscillators, 
with each having its characteristic oscillation frequency v. They radiate electromag- 
netic radiation out into the cavity and absorb radiation from the cavity; a thermal 
equilibrium is established between the radiation in the cavity and the atoms in the 
cavity walls. The excitation of the oscillators depends on the temperature. 

2) The oscillators cannot - like classical oscillators - take on all possible values of 
energy, but rather only discrete values described by 



En = n • h • V , (5.7) 

where n is an integer /? = 0,1,2,... and h is Planck’s constant (/z = 6.626176 
• 10"^'^ Js = 4.14 • 10“^^eVs). Today we know that this quantisation is more cor- 
rectly described by the equation 
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Eri = {n + \/2)hv. 



(5.8) 



The quantity hv/2 is the zero-point energy of the oscillator. We will derive (5.8) 
using a quantum mechanical treatment in Chap. 9. 

3) As long as the oscillator is not emitting or absorbing (radiation) energy, it remains 
in its quantum state, characterised by the quantum number n. 

4) The number of possible states of oscillation of the electromagnetic field in the 
cavity of volume F between v and v + rfv for both polarisation directions is given by 



dZ = 



8 7tFv^ 



dv , 



(5.9) 



which can be derived in classical electrodynamics. 

The existence of discrete energy values represents a contradiction to the experience 
of classical physics, where energy always seems to occur with continuous values. The 
reason that quantised energy steps are not observed in classical physics is the smallness 
of the Planck constant h. We give a numerical example to make this clear: 

A mass-and-spring harmonic oscillator with a mass m = 1 kg and a spring constant 
f =20 Nm“^ is oscillating with an amplitude Xq = lO'^m. Its characteristic frequency 
is then given by 

v = (\/2n)Vf7m = Q.l\ s"*. 

The energy of the oscillator is 

£’=/x^/2 = 1.0-10“^J. 



This energy corresponds to n energy quanta of the frequency v: 



n = E/h V = 



10“^ J 

6.6-10“^^Js-0.7s“‘ 



= 2.1 



• 10 ^° . 



If n now changes to « ± 1 , this produces a relatively energy change of 
AE/E= 

n- hv 



The relative change is thus extremely small. 

From this we conclude that an energy quantisation for macroscopic systems in the 
realm of classical physics cannot be detected, due to the extremely large quantum 
numbers which occur. 



5.2.3 Einstein’s Derivation of Planck’s Formula 

The derivation of the Planck radiation formula due to Einstein (1917) is an interesting 
example of the combination of optics, thermodynamics, and statistics. Einstein 
assumed that light consists of particles, the so-called light quanta or photons. Each 
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light wave of frequency v corresponds to a number of photons. Furthermore, in this 
theory the existence of discrete atomic energy levels is already assumed. The justifica- 
tion for this latter assumption in terms of the Bohr atomic model will be treated in 
Chaps. 8 and 9; we thus anticipate it here. 




Absorption Spontaneous emission Stimulated emission 



Fig. 5.4. Absorption, sponta- 
neous and stimulated emission of 
radiation with the quantum ener- 
gy v between two energy levels 
and E2 



An atom with two energy levels £’ 1 , £2 rn^y» according to Einstein, interact with 
electromagnetic radiation in three different ways: 

- Absorption of a light quantum takes the atom from the lower level £j into the 
energetically higher level £ 2 . In the process, a light quantum of energy 
AE = £2-£i = /? V is removed from the radiation field. 

- Emission occurs from the level £2 spontaneously within a time known as the mean 
lifetime. A light quantum of energy AE is thereby released into the radiation field. 

- Just as light quanta can be absorbed, light quanta in the radiation field can also 
stimulate the emission of further quanta when the atom is in the higher level £ 2 . For 
this stimulated or induced emission, primary light quanta are thus necessary. 
Another light quantum joins those which were already present in the radiation field. 

These processes are shown schematically in Fig. 5.4. 

In order to derive Planck'^ formula, we consider, following Einstein, a system of N 
atoms. The numbers of atoms in the level £1 or £2 are denoted by and A 2 , respec- 
tively. The system is taken to be in thermal equilibrium with its surroundings. Inter- 
actions with the radiation field are only possible in the form of emission or absorption 
of radiation as discrete energy quanta hv = Ei-E^. 



E2,N2 



B,2 



A2I 



B21 



E 



1 / 



Ni 



Fig. 5.5. Derivation of the Planck formula: two energy levels E^ 
and E2 with occupation numbers and N2 are connected by 
transitions with the probabilities B^2^ B21 ^21 



The radiation field is taken to have the radiation energy density u{v,T), which we 
will denote in the following simply as w(v). It then produces, per unit time, the 
following transitions (Fig. 5.5): 

Absorption from 1 to 2. The number of processes in time rfMs proportional to the 
occupation number of level 1 and to the radiation energy density w(v): 
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dNx2 = Bx2u{v)Nxdt . (5.10) 

The proportionality constant Bx 2 is called the Einstein coefficient and is a measure of 
the transition probability per unit time and radiation density. 

Transitions from 2 to 1 are composed of two contributions, as seen in Fig. 5.5: the first 
is spontaneous emission from 2 to 1. The number of such processes per unit time is pro- 
portional to the occupation number N 2 . We have: 



dN^x = A2xN2dt, (5.11) 

A 2 X is also an Einstein coefficient and is a measure for the transition probability per 
unit time. Furthermore, we have induced emission from 2 to 1. It is, analogously to 
(5.10), proportional to the occupation number N 2 and to the radiation density u{v). 
The result is: 



dN2x ~ B2X u{y^N2dt . 



(5.12) 



B 2 X is defined in an analogous way to the Einstein coefficient Bx 2 in (5.10). 

In equilibrium, an equal number of transitions occurs in each direction. We must 
therefore have 



dNx2 = dNt2x + dN^x • (5.13) 

Setting (5.10) and (5.11, 12) equal leads to the following ratio of the occupation 
numbers: 



N,/N , . 

^21 + ^21^(t) 



(5.14) 



Since the system is in thermal equilibrium, the ratio of the occupation numbers of 
the two energy levels can also be calculated according to the Boltzmann distribution. 
It must then be true that 

N2/NX = Q-Ei/kT/^-E,/kT ^ 

From these two equations follows 

^ ^-E,/kT/^-E,/kT 

^21 + ^21^(v) 



and 



u(v) = 



^21 

p p,hv/kT p 

J5x2^ ~^2\ 



(5.17) 



with the abbreviation E 2 -E^ = hv. 

To determine the eoefficients A and B, we use the limiting condition that m(v) must 
go to infinity when oo, i.e., the denominator of (5.17) must go to zero. Then we have 



B\2 — Bix . 



(5.18) 
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From this follows 



m(v) = 



^21 



(5.19) 



Furthermore, the experimentally verified Rayleigh- Jeans law must hold for small fre- 
quencies, i.e., for hv < kT, see (5.5), 



m(v) = 



Snv^kT 



(5.20) 



For small values of the exponent {Hv/kT) we can use a series expansion for the 
exponential function: exp{hv/kT) = 1 + hv/kT+ . . . . Inserting this in (5.19) yields 



w(v) = ^d^L-^ (for hv<kT), 
Bn hv 



(5.21) 



which, combining with the Rayleigh-Jeans law (5.20), leads to 



121 



8 nhv 



3 



B 



(holds generally) . 



12 



(5.22) 



Finally, inserting in (5.19), 



u(v) = 



%nhv^ 






(5.23) 



Equation (5.23) is the Planck formula. 

Rearranging (5.22), we find for the relation between the Einstein coefficients for 
transitions between levels 2 and 1 , 



^21 - 



^nhv^ _ 

5 ^12 • 

C 



(5.24) 



This corresponds to the Kirchhoff relation, according to which the probabilities for 
spontaneous emission and absorption are proportional. 

Equation (5.18) is, furthermore, an expression of the fact that the radiation field 
takes up and gives out radiation quanta in like fashion; absorption and stimulated 
emission are fully complementary physical processes. 

Einstein'^ derivation of the Planck formula lends strong support to the existence of 
light quanta of energy hv. From the equation E = // v and the equivalence of mass and 
energy, E = mc^, it follows that a mass can also be ascribed to the photon, having the 
value mph = hv/c^. However, the rest mass of the photon is in fact zero. 
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5.3 The Photoelectric Effect 



In the year 1888, Hallwachs measured for the first time the laws governing the release 
of electrons from metals by light, the photoelectric effect, following earlier observa- 
tions by Hertz^ The results of his experiments were explained in 1905 by Einstein using 
the hypothesis of light quanta. 

The photoeffect may be simply demonstrated, qualitatively, using the setup shown 
in Fig. 5.6. A zinc plate is activated on its surface by rubbing with mercury and is 
mounted in an electrically insulated holder. If it is negatively charged and then illu- 
minated with ultraviolet light, it rapidly discharges. A positively charged plate cannot 
be discharged by light. 

From these experiments we see that light sets electrons from the plate free. The 
negatively charged plate releases these electrons to the surrounding air; the positively 
charged plate retains them due to Coulomb attraction. These experiments may be made 
more quantitative by replacing the electrometer with a so-called dropping electrometer; 
the quantity of charge released from the plate can then be measured as a function of the 
intensity and energy of the light. 

If one also wishes to measure the kinetic energy of the electrons, the counter-field 
method may be used: one measures the maximum voltage which is just sufficient 



+ 






Fig. 5.6. Photoeffect. Upper part: A negatively 
charged electrometer is discharged upon illumination 
of the electrode, a positively charged one is not. 
Lower part: Arrangement for quantitative study of 
the photoeffect (voltage P, current I) 



Light 







Fig. 5.7 a -d. Quantitative results for the photoeffect, a) Photocurrent 
/ as a function of the frequency v of the light. Below the limiting fre- 
quency Vijjn there is no longer a photocurrent, b) Photocurrent / as a 
function of the applied voltage V. Positive values of the voltage here 
mean that the irradiated electrode is the cathode. The largest negative 
voltage which can still be overcome by the photoelectrons (when the 
irradiated electrode is the anode) is The saturation current /§ is a 
function of the light intensity P. c) Maximum braking voltage as a 
function of the light frequency v; measurement of the ratio h/e and of 
the work function as the slope and intercept of the straight line 
according to (5.28). It should be remembered that in the counter-field 
method, both work functions, that of the cathode and that of the 
anode, are to be taken into account, d) Saturation current as a func- 
tion of the light intensity P. The current increases with the intensity 
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to keep the electrons from leaving the plate. For this purpose, the setup shown in the 
lower part of Fig. 5.6 may be used, with the illuminated electrode however attached to 
the positive pole of the voltage source. 

The results of such an experiment are shown in Fig. 5.7: the electron current / as a 
function of the frequency v of the light begins at a limiting frequency Vii^ which is char- 
acteristic for the electrode material (Fig. 5.7a). The maximum kinetic energy of the 
electrons follows from the current-voltage characteristic curve of the apparatus 
(Fig. 5.7b). If the counter voltage - the braking potential - reaches a certain value 
Fjnax» dependent on the frequency of the light, the photocurrent drops to zero and 
remains so. The electrons which are emitted no longer have sufficient energy to over- 
come the braking potential. The expression = mv^/2 gives the velocity of the 
electrons. If the maximum braking potential F^ax is plotted against the frequency of the 
light, a straight line is found (Fig. 5.7c). 

To understand these experiments, we need the light quantum hypothesis. Classical- 
ly, it would be expected that the electric field E of the light, which is proportional to the 
square root of the light intensity, is responsible for the acceleration and release of the 
electrons from the electrode. The energy of the photoelectrons should increase with in- 
creasing light intensity. We find, however, that the energy of the photoelectrons does 
not depend on the light intensity (and thus on the radiation power), but only on the fre- 
quency of the light. 

On the other hand, the number N of emitted electrons is proportional to the inten- 
sity P of the light (Fig. 5.7 d). 

Photoelectrons are only emitted when the light frequency is larger than a charac- 
teristic value viini which depends on the electrode material. The following relation must 
hold: 

hv^hvxira = eVp,. (5.25) 

Clearly, a part of the light energy /?v is used to release an electron from the metal 
electrode. For this purpose an amount of energy equal to eVp,, called the work 
function, is required. This work function is specific for the electrode material. The 
remainder of the energy of the light quantum is available to the electron as kinetic 
energy. The total energy of the light quantum is thus transferred in an elementary 
process to the electron. The energy balance is given by 

mv^/2 = hv (5.26) 

kinetic energy = quantum energy — work function 

of the photoelectron of the light of the photoelectron 



The kinetic energy of the photoelectrons is equal to the energy e F^ax^ thus we can write 
(5.26) in the form 

eVm^=hv-eV^ (5.27) 

or 

Vm^=hv/e-V^. (5.28) 

The slope of the straight line which is obtained by plotting F^ax against the fre- 
quency V of the exciting light (Fig. 5.7c) can be used for a precision measurement of the 
ratio h/e. For the angle a of the slope we have 
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tdina = h/e, (5.29) 

Table 5.1 gives some examples for the work functions of various metals. The alkali 
metals are notable for their especially small work functions. 



Table 5.1. Work functions eVp^ and limiting wavelengths of several metals 



Metal 


eK^leV] 


hm [nm] 


Li 


2.46 


504 


Na 


2.28 


543 


K 


2.25 


551 


Rb 


2.13 


582 


Cs 


1.94 


639 


Cu 


4.48 


277 


Pt 


5.36 


231 



An arrangement in which the electrons released from an electrode (photocathode) 
complete a circuit, open in the absence of light, between the photocathode and another 
electrode is called a photocell Photocells are used in numerous applications in meas- 
urement and control technology. 

Apart from the so-called external photoeffect, which we have discussed above, the 
same phenomenon is met again in many other areas of physics. In solid state physics, 
the release of normally bound charge carriers by the action of light is called the internal 
photoeffect. In this case, an increase in the electrical conductivity of semiconductors or 
insulators upon illumination may be observed. In nuclear physics, atomic nuclei can be 
excited and caused to emit nucleons, i.e., the particles of which the nucleus is 
composed, upon absorption of very short wavelength radiation (x-rays or gamma 
rays). This is termed the nuclear photoeffect. 



5.4 The Compton Effect 

5.4.1 Experiments 

The Compton effect is the name given to the scattering of light by weakly bound or free 
electrons. This effect occurs in particular in the x-ray region of the electromagnetic 
spectrum. The incident light wave (x-radiation) excites electrons in the target atoms so 
that they oscillate. The oscillating electrons in the field of the positively charged nuclei 
may be considered as classical oscillators; they emit radiation themselves, with the same 
frequency as that with which they are driven. This radiation is called Rayleigh-scattered 
radiation. The theory of Rayleigh scattering was first developed for visible light; it 
explains the blue colour of the sky. Light of short wavelength (blue) is more strongly 
scattered than light of long wavelength (red light). The scattered radiation has the same 
wavelength as the primary radiation, and is polarised. In 1909 it was shown by Barkla 
that this type of scattering also occurs with x-radiation. 

In 1921, Compton observed that in addition to the spectrally unshifted scattered 
radiation, a spectrally displaced component appeared (Fig. 5.8). There is a simple 




5.4 The Compton Effect 



61 



Incident 

x-radiation 



Scattered 

x-radiation 




Fig. 5.8. Compton Effect. Upper part: Experimental setup. 
The x-radiation which is scattered by the target (e.g., a 
graphite block) is measured as a function of the scattering 
angle 6, which is defined to be 0° for undeflected radiation 
and 180° for radiation reflected back towards the source. 
Lower part: Measured scattered radiation as a function of 
wavelength for various scattering angles. The unshifted 
Rayleigh-scattered radiation as well as the shifted Compton- 
scattered radiation are seen 




relation between the shift in the wavelength and the scattering angle: independently of 
the target material, it is found that 

ZIA = Ac(l-cos0) (5.30) 

with the Compton wavelength Ac = 0.024 A. The wavelength shift zl A is also completely 
independent of the primary wavelength. Only the intensity of the Compton scattering 
depends on the target material: it is especially large for light materials (Z small), as 
compared to the x-ray absorption which increases approximately with (18.5). 

Two numerical examples may serve to illustrate Compton scattering; the wave- 
length shift is a maximum at 0= 180°. At an energy of 1000 eV for the light quanta 
before scattering, the energy of the 180° -scattered radiation is 996 eV; at a primary 
energy of 1 MeV, the 180°-scattered radiation has an energy 200 keV. In the former 
case, the energy is reduced by 4 eV or 0.4%, in the latter by 800 keV or 80%. In both 
cases, the corresponding shift in the wavelength is about zl A = 0.050 A. 



Fig. 5.9. Explanation of the Compton effect: the incident x-ray 
quantum with energy E = hv and momentum p = h v/c collides 
with an electron. In the collision, it transfers energy and momen- 
tum to the electron; the scattered x-ray quantum thus has a reduced 
energy hv' and a reduced momentum hv'/c 



E = mc^, p = mv 
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The explanation of these experiments was not possible within the wave picture of 
light. Using the hypothesis of light quanta, the effect can be represented as a collision 
between two particles, a photon and an electron (Fig. 5.9). In the collision, energy and 
momentum are transferred. 

More precisely, we are dealing with an elastic collision between radiation quanta 
and electrons that are weakly bound in the outer shells of atoms with initial velocities 
i?o = 0. The binding energy of the electrons is assumed to be so small that it can be 
neglected in comparison with the photon energy in the following derivation. 



5.4.2 Derivation of the Compton Shift 

We consider the Compton effect as an elastic collision between a photon and an elec- 
tron. Energy and momentum conservation must both be fulfilled. The momentum and 
the kinetic energy of the electron before the collision are practically zero. The calcula- 
tion must be done relativistically, leading to the following equations (see Fig. 5.9). 
Energy before and after the collision is conserved, so that 

hv + moC^= hv' + mc^ , (5.31) 



Here rriQ is the rest mass of the electron and m its mass after the collision, and v and v' 
are the frequencies of the radiation before and after the collision. 

For the momentum in the y direction before and after the collision we have 



0 = 



hv’ 

c 



sind-mv sin 



(5.32) 



and for the momentum in the x direction 



c 



hv' 

c 



cos6+mv COS0 



(5.33) 



In (5.31), we move hv' to the left side and abbreviate v- v' = Zl v. We then express 
the mass of the moving electron in terms of the rest mass, using the relativistic mass 
formula m = m^iX - y If we now square (5.31) and rearrange somewhat, we 

obtain 

.,2 

h^{Av)^^2moC^hAv= (5.34) 

c - V 



In order to eliminate the angle 0 from (5.32) and (5.33), we solve these equations 
for sin0 and cos0 and use the identity sin^0 + cos^0 = 1. When we rearrange, we 
obtain 

,,2 

/?^[(zl v)^+2v(v-Zl v)(l -COS0)] = r- . (5.35) 

c -V 

Since the right-hand sides of (5.34) and (5.35) are identical, we can set the left-hand 
sides equal to one another, obtaining 

m^c^hAv = h^v(y- Av){\ -cos0) . 



(5.36) 
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We wish to express the result in terms of wavelengths instead of frequencies. Using 
c = Av, we obtain 



MA| = 



cAv 

v(v-Av) 



V v-Av 
Inserting (5.37) into (5.36), we have finally 
h 



MAh 



rrioc 



- (1 - cos 6) = Ac(l - cos 9 ) , 



(5.37) 



(5.38) 



where we have introduced the abbreviation k^=h{rriQc)~^ (the “Compton 
wavelength”). Incidentally, the quantum energy of radiation which has the Compton 
wavelength Ac is just equal to the rest energy of an electron: 

hv = hc/Xc = = 511 keV . (5.39) 

The energy and momentum of the recoil electrons may also be calculated with these 
equations. The energy received by the electrons is relatively small, but their paths can 
still be seen in a cloud chamber and be quantitatively determined. This was shown in 
1925 by Compton and Simon, 

Another experiment done by Bothe and Geiger in 1925 shows that the electrons and 
the photons are, in fact, “emitted” simultaneously in the Compton effect (Fig. 5.10). A 
scattering target is set up in the centre between symmetrically placed electron and 
photon detectors. The number of simultaneous counts in the two detectors is measured 
with a coincidence circuit, with the result that the number of coincidences observed is 
far greater than the number which would be expected by chance from uncorrelated 
events. 




Fig. 5.10. Apparatus to detect coincidences between 
scattered x-ray quanta and recoil electrons from the 
Compton effect 



The following remarks may be useful for a deeper understanding of the Compton 
effect: 

- Compton scattering is relatively weak in the case of strongly bound electrons, that is 
in heavy atoms. When the electron’s binding energy is large compared to the 
quantum energy hv of the photon, no momentum transfer is possible. 
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- In certain energy ranges - in particular for medium-hard x-rays - the Compton 

effect is the principal cause of scattering and attenuation of radiation in matter. 

- In Compton scattering, the incident and the scattered radiation are incoherent 

relative to one another hv i^hv’). 

We see that the photoeffect and the Compton effect can only be understood by 
assuming that light (radiation) consists of individual particles with a momentum p. On 
the other hand, we know from interference and diffraction experiments that light 
behaves as a wave, characterised by a wavelength A and a frequency v. The particle 
aspects of light which we summarised at the beginning of this chapter find their con- 
firmation in the photoelectric and Compton effects. 

How can we reconcile the wave and the particle character of the same phenomenon? 
To clarify this question we imagine the following experiment (Fig. 7.5); we allow a 
beam of light to pass through a small hole in an opaque wall and to fall on a screen. On 
this screen we observe, following the laws of wave optics, a diffraction pattern. We 
could, however, detect the light falling on the screen by means of the photoeffect or the 
Compton effect. If we now make the intensity of the light weaker and weaker, we find, 
using the photoeffect as detector, that locally, at particular points, single photoelec- 
trons are set free by the light. The particle character of the radiation is here predom- 
inant. If this experiment is now repeated a number of times and the relative abundance 
of photoelectron releases is kept track of as a function of position, a distribution curve 
will be obtained which is in exact agreement with the diffraction pattern. 

This thought experiment, which could actually be carried out as a real experiment, 
gives us the key to explaining the nature of light. Light carries - virtually, so to speak 

- both properties - wave and particle - within itself. Depending on the experiment 
we carry out, it shows us the one or the other aspect of its character. In order to 
combine both aspects, which at first appear contradictory, we have to apply statistical 
considerations. Thus in the case just described, if we perform an experiment which is 
intended to detect the diffraction pattern, but then inquire about the particle character, 
we find that we cannot predict the point at which the light particle will strike the screen 
with certainty. Instead, we can only give the probability that it will arrive at a particular 
point. The resulting probability distribution is then identical with the diffraction 
pattern which we would calculate according to the laws of classical physics. This 
statistical way of considering things is, as we shall see again and again, fundamental to 
the quantum mechanical interpretation of physical phenomena (Chap. 7.2). 



Problems 

5.1 Express the relativistic mass of a photon in terms of h, A and c. 

5.2 What is the momentum of a photon with 1 eV energy? Give the corresponding 
wavelength in Angstroms. 

5.3 How much mass does a 100 W light bulb lose in one year due to light emission? 

5.4 A photon with 2 MeV of energy is converted into a positron-electron pair. What is 
the kinetic energy of the positron and electron, if the energy is equally distributed 




Problems 



65 



between the two particles and the electrostatic interaction between the two is ignored? 
{rrie- = me+ ^ 0.511 MeV). How large is their velocity? 

5.5 In the upper atmosphere, molecular oxygen is split into two oxygen atoms by 
photons from the sun. The longest wavelength of photons which can do this have 
A = 1.75 X 10“^m. What is the binding energy of O 2 ? 

5.6 A person can perceive yellow light with the naked eye when the power being 
delivered to the retina is l.SxlO’^^W. The wavelength of yellow light is about 
6000 A. At this power, how many photons fall on the retina each second? 

5.7 A monochromatic beam of electromagnetic radiation has an intensity of 1 W/m^. 
What is the average number of photons per m^ and second for a) 1 kHz radio waves 
and b) 10 MeV gamma rays? 

5.8 Calculate the radiation pressure of sunlight when the incoming energy/(sm^) is 
1 .4 X 10^ W/m^ and the radiation is completely absorbed. Compare this value with the 
atmospheric pressure. What is the force on a surface of 1 m^? What is the force if the 
light is completely reflected? 

Hint: The radiation pressure is the momentum transferred per unit of time and surface 
area. 

5.9 A photon which is emitted by an atom imparts an equal and opposite momentum 
to the atom. 

a) What is the kinetic energy transferred to the atom if the frequency of the photon is v 
and the mass of the atom is Ml 

b) How much energy is transferred to the Hg atom in the emission of the mercury 
spectral line A = 2357 A? « 200.6 u). 

c) What is the corresponding reaction energy in the emission of y quanta with 
1.33 MeV energy by ^^i? (Mni « 58.7 u). 

Compare these values with the energy uncertainty due to the lifetime according to 
(7.29)(THg = 10-«s, TNi-10-'"s). 

5.10 What is the temperature of a black sphere with a diameter of 10 cm which is 
emitting a total of 100 W thermal radiation? How much mass is lost each year by being 
radiated away? 

Hint: Use the numerical values from (5.3). 

5.11 Calculate the temperature of the sun and the energy density of the radiation in its 
interior, assuming that the sun is a spherical black body with a radius of 7 x 10^ m. 
The intensity of the solar radiation at the surface of the earth (which is 1.5 x 10^^ m 
from the sun) is 1.4 x 10^ W/m^. Assume that the energy density in the interior of the 
sun is homogeneous. Is this realistic? 

5.12 What is the wavelength of the spectral maximum of the radiation from a black 
body at 300 K (room temperature)? Calculate the monochromatic energy density at this 
frequency. 
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5.13 A photon releases a photoelectron with an energy of 2 eV from a metal which 
has a work function of 2 eV. What is the smallest possible value for the energy of this 
photon? 

5.14 The work function for the photoeffect in potassium is 2.25 eV. For the case 
when light with a wavelength of 3.6 x 10"^ m falls on the potassium, calculate a) the 
braking potential of the photoelectrons and b) the kinetic energy and the velocity 
of the fastest of the emitted electrons. 

5.15 A homogeneous monochromatic light beam, wavelength 4.0 x 10”^ m, falls per- 
pendicularly on a material with a work function of 2.0 eV. The beam intensity is 
3.0 X 10“^ W/m^. Calculate a) the number of electrons emitted per m^and second, b) the 
energy absorbed per m^ and second, and c) the kinetic energy of the photoelectrons. 

5.16 A metal surface is irradiated with light of various wavelengths A. The braking 
potentials F indicated in the table are measured for the photoelectrons. 





K[V] 


A[10“’'m] 


K[V] 


3.66 


1.48 


4.92 


0.62 


4.05 


1.15 


5.46 


0.36 


4.36 


0.93 


5.79 


0.24 



Plot the braking potential along the ordinate versus the frequency of the light along the 
abscissa. Calculate from the curve a) the threshold frequency, b) the photoelectric work 
function of the metal, and c) the quotient h/e. 

5.17 The yellow D lines of sodium appear when sodium vapour is irradiated with elec- 
trons which have been accelerated through a potential difference of 2.11 V. Calculate 
the value of h/e. 

5.18 In a Compton effect experiment, the scattered light quantum is observed at an 
angle of 60° to the direction of the incident light. After the collision, the scattered elec- 
tron moves in a circular path with a radius i? = 1.5cm in a magnetic field 
1^1 = 0.02 Vs/m^ which is perpendicular to the plane of the electron path. What is the 
energy and the wavelength of the incident light quantum? 

Hint: Use (6.7) with IjFI = 0 to calculate the electron’s path. 

5.19 A photon with 10"^ eV energy collides with a free electron at rest and is scattered 
through an angle of 60°. Calculate a) the change in energy, frequency and wavelength 
of the photon, and b) the kinetic energy, momentum and direction of the electron after 
the collision. 

5.20 X-rays with a wavelength of 1 A are scattered on graphite. The scattered radia- 
tion is observed perpendicular to the direction of the incident x-rays. 

a) How large is the Compton shift A XI 

b) How large is the kinetic energy of the ejected electron? 

c) What fraction of its original energy does the photon lose? 

d) How large is the corresponding fraction of energy lost by a photon with a wave- 
length A = 0.1 A if it is deflected through 90° by Compton scattering? 

The electron should be considered at rest before the collision, and the binding energy 
should be neglected. 




Problems 



67 



5.21 A pulsed laser with a repetition rate of 20 Hz produces light pulses with a length 
of 30 ps (picoseconds). Its laser medium is Nd: YAG and the wavelength is 1064 nm. 
The light pulses are passed through a frequency-tripler (efficiency 10%, i.e. only 1/10 
of the incident energy is available at the output of the frequency-tripler). An output 
pulse energy of 1 mJ is required at the output wavelength X (to be calculated). 

a) How many photons n are contained in an output pulse? 

b) How high is the peak power P of the laser system at the output of the tripler? 

c) What is the average light power / of the pulsed laser before the tripler, if only 0.1 % 
of the electrical power is converted into light power? 
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6.1 Production of Free Electrons 

The name “electron”, which is derived from the Greek word for amber, was coined by 
the English physicist Stoney in 1894. 

In the early days of atomic physics, free electrons were usually produced as cathode 
rays from gas dicharges. Now, however, they are most often obtained using thermionic 
emission from wires. This process has the advantage that the electrons can easily be 
focussed and accelerated. Free electrons can also be produced by utilising the photo- 
effect (Chap. 5) or in the form of emissions from radioactive nuclei. 



6.2 Size of the Electron 

The electron is just as invisible as the atom; indeed, as a component of the latter, it 
must be smaller. We first arbitrarily define a parameter, called the classical electron 
radius, by making the following assumptions: 

- the electron is a sphere with radius ^ei and surface charge - e, 

- the energy of the rest mass E = is equal to the potential electrostatic energy of 
the surface charge. 

We use the formulae of classical electrostatics to calculate the electrostatic energy. 
The capacitance of a spherical surface of radius r is 



C = . 



( 6 . 1 ) 



The work required to add a charge ^ to a capacitor with capacitance C is 
W=\q^/C. 



( 6 . 2 ) 



Therefore the potential energy of a spherical capacitor, i.e. the energy of its elec- 
trostatic field, is 



£'pot = ■ 



8 7T£o^ 



(6.3) 



The condition that {m^ is the rest mass of the electron and c is the velocity 

of light) determines the radius r = 



r^\- 



2 • Ansomoc 



(6.4) 
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Other assumptions regarding the distribution of charge (e.g., continuous distribu- 
tion of charge throughout the volume instead of a surface charge) lead to somewhat 
different numerical values. The “classical” electron radius is, finally, defined as 

IS 

/•e, = 5- = 2.8-10“‘^m. (6.5) 

It can be seen from this derivation that the parameter is purely conceptual. Is it 
possible to measure the electron’s radius? In principle, yes. For example, the scattering 
cross section can be determined by irradiating with x-rays, exactly as the scattering 
cross sections of gas atoms may be determined (Chap. 4). The result of such experi- 
ments is a cross section 0 = 11 - and the parameter is found to be of the same 
order of magnitude as the classical electron radius defined above. 

Experiments in which electrons are scattered by electrons reveal no deviations from 
the Coulomb law, even at very small collision distances. All results up to now have thus 
yielded only the information that the electron is a structureless, point-like particle. 



6.3 The Charge of the Electron 

As mentioned earlier, the charge of the electron, - e, can be derived from the Faraday 
constant F\ 



e = F/Np , . (6.6) 

However, since we wish to use this equation to determine Avagadro’s number 
we require an independent method of measuring the elementary charge e. 

This was accomplished by Millikan'^ experiment (1911), in which the charge on 
small drops of oil is determined from their motion in the electric field of a capacitor. It 
remains the best method for determining e. 

The principle of the method is illustrated in Fig. 6.1 . Figure 6.2 shows the entire ex- 
perimental setup. The rising or falling velocity of a charged oil droplet in the homo- 
geneous electric field of a capacitor is determined by the resultant of all the forces 
acting on the droplet: electrostatic force, gravitational force, buoyancy in the air, and 
friction with the air. We will pass over the details of the measurement procedure; we 
mention only, as a curiosity, the fact that an incorrect value of the quantity e was used 
for some years due to an error in the determination of the viscosity of the air. The latter 
quantity is needed to calculate the frictional force acting on the droplet which is rising 
or falling. The best value of the elementary charge is currently e = (1.6021917 
±0.0000070)- 10“ ‘^C. 

The question has been repeatedly raised, whether there are smaller amounts of 
charge than the so-called elementary charge e. Up to the present time, no smaller 
charges have been detected unambiguously. 

To explain the structure of elementary particles, the existence of more fundamental 
elementary particles, the “quarks”, has been suggested; they would have charges of e/3 
or 2e/3. A number of experiments in high energy physics can, in fact, be understood by 
the assumption that such particles exist, but are bound to each other and/or to their anti- 
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Fig. 6.1. Principle of the Millikan oil-drop 
experiment for measuring the charge of the 
electron. A negatively charged oil droplet ex- 
periences the force neE, where n is the 
number of elementary charges on the 
droplet; the graviational force mg acts in the 
opposite direction 



Fig. 6.2. Experimental arrangement of 
Millikan from Phys. Rev. 2, 109 (1913). The 
oil droplets, which are formed by the 
atomiser, can be charged or discharged by ir- 
radiation with x-rays 




droplets 



<5= To pump 




Liquid 
jacket for 
temperature 
control 





particles. On the other hand, there has so far been no convincing proof of the existence 
of free quarks, and theoreticians have even developed a theory of “confinement”. 
According to this theory, the forces between quarks become so large that they can 
never appear as individual particles. 



6.4 The Specific Charge e/m of the Electron 

The mass of the electron is determined by measuring the deflection of electrons in 
electric and magnetic fields. The motion is determined by the ratio of charge to mass 
e/m according to the equation 

F = = - e[E {v xB)] . (6.7) 

dt 

Following the first e/m measurement by Thompson (1897), many methods for measur- 
ing this quantity were developed in the next 50 years, but all of them were basically 
variations of the same principle which we have already discussed for the parabolic 
method in mass spectroscopy. 

The method of Classen (1907) is particularly elegant (Fig. 6.3). TheE field between 
the cathode and the film imparts a uniform, known velocity to the electrons. The 
kinetic energy, to a non-relativistic approximation, is 



( 6 . 8 ) 
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Film strip 
{top view) 



Fig. 6.3. Experimental arrangement for measurement of the specific 
mass m/e of the electron. The electrons are generated at the cathode C 
and accelerated by the voltage V. They are deflected into circular paths 
by a magnetic field perpendicular to the plane of the figure and 
recorded on a film. The direction of the deflection is reversed by 
reversing the poles of the magnet 



where V is the accelerating voltage. Rearranging, 



V = 



lev 



m 



(6.9) 



The B field deflects the electrons into a circular path with radius r. Equating the 
Lorentz and the centrifugal forces yields 



mv 



= evB . 



( 6 . 10 ) 



The desired ratio of charge to mass follows from (6.9) and (6.10): 

e _ IV 
m r^B^ 



( 6 . 11 ) 



One can thus obtain the ratio e/m by measurement of a voltage, a magnetic field 
strength and a distance. Over the years, various other methods were used to measure 
the specific mass m/e. They differ primarily in the relative positions of the electric and 
magnetic fields. Figure 6.4 shows one of these other experimental arrangements, one 
which corresponds in principle to the Aston mass spectrograph. 

The dependence of the mass on the velocity was examined quite early with the help 
of these experiments. Table 6.1 lists some measured values for e/m. 

The limiting value of m as the kinetic energy of the electron went to zero was found 
to be Wo = 1.7588 • 10“^‘kg or (5.485930 ±0.000034) • 10 “■‘u. lu is thus 1822.84 Wq. 

The dependence of the ratio e/m on the particle velocity was found experimentally 
in 1901 (4 years before Einstein'^ theory of relativity) by Kaufmann. Kaufmann used 
the method known in mass spectroscopy as the Thomson parabola method (Figs. 
3.2, 3). The particles are deflected in transverse £* and ^ fields. The electrons studied by 
Kaufmann were f particles from radioactive sources, because his experiments were 
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Table 6.1. Specific charge of the electron at different accelerating 
voltage 



Accelerating 0 


500 


1000 


1500 


voltage V [kV] 








Measured specific 1.76 


0.88 


0.56 


0.44 


charge e/m [10^^ C/kg] 









Fig. 6.4. Arrangement for measurement of the ratio e/m for electrons. Here the electrons are deflected first 
by an electric field and then by a magnetic field 



meant to clarify the physical nature of rays. However, the photographic record of the 
particles did not yield the segment of a parabola which would be expected if the ratio of 
e/m were constant at different velocities of the particles. 

Kaufmann's curves can be understood in the following way: For a uniform value of 
e/m, each point of the parabola correspondings to a particular value of the velocity v. 
From the fact that a parabolic segment is actually observed for lower velocities (larger 
deflections), it can be concluded that the slower particles have a continuous velocity 
distribution at a constant mass. However, at high velocities v the mass appears to 
increase steadily. The curve therefore passes through points on each of a series of 
adjacent parabolas, corresponding to successively higher masses m. 

These measurements were the first to substantiate the dependence of mass on 
velocity. They fit the Lorentz equation 



m = mo 



1 

l/l-yVc^ ’ 



( 6 . 12 ) 



It was later shown that this equation can also be derived from the theory of rela- 
tivity, if the validity and Lorentz invariance of the conservation of energy is assumed. 
The equation is equivalent to the principle E = mc^. Figure 6.5 shows experimental 
values of the dependence of the mass on the velocity. 

The following qualitative argument was advanced in an attempt to understand the 
change in mass with velocity: When the electron is accelerated, part of the energy is 
absorbed in the creation of the magnetic field of the moving electron - which is of 
course an electric current. Thus an “electromagnetic mass” is added to the inertial 




0.4 0.5 0.6 0.7 0.8 0.9 






Fig. 6.5. Experimental values for the mass of the 
electron as a function of its velocity (test of the 
Lorentz formula). The mass m, in units of the rest 
mass aWq, is plotted against the velocity in units of 
the velocity of light, v/c 
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mass. This argument fails, however, because the mass increase is also observed with 
neutral particles, e.g. neutrons. 

As time went on, the Lorentz equation was tested by many highly precise measure- 
ments. It was possible to confirm it to within 1.5% by measurement of electron 
energies under 1 MeV. In modern electron accelerators, it has been confirmed to far 
better precision, as shown in Fig. 6.5, and is now a well established tenet of physics. 

It is instructive to consider the error one would make by neglecting the relativistic 
mass increase. From conservation of energy and (6.12) we find for a kinetic energy 
^kin = 1 keV a velocity v/c = 0.063, and for 1 MeV, v/c = 0.942. The relativistic mass 
increase is then according to (6.12) at 1 keV 4-10"^ times the rest mass mo, but at 
1 MeV it is already 2 times mo, i.e. (m - mo)/mo = 2. 



6.5 Wave Character of Electrons and Other Particles 



The motion of electrons in electric and magnetic fields may initially be understood as 
a particle motion. We have treated it as such in explaining various experiments up to 
now. There are, however, a number of different experiments in which electrons and 
other particles show interference and diffraction phenomena, that is they exhibit wave 
character. In this section, we will discuss the experimental grounds for assuming the 
existence of matter waves. 



Experiment 1: The Ramsauer Effect (1921) 

The measurement of the interaction cross section for collisions of extremely slow elec- 
trons with gas atoms yielded very small values, much smaller than those found in the 
kinetic theory of gases; at somewhat higher electron energies, the values were found to 
be much larger (Fig. 6.6). This type of minimum in the velocity dependence of the inter- 
action cross section could be explained as the result of diffraction by particles whose 
size was comparable with the wavelength of the electrons. 







Fig. 6.6. Scattering coefficient a= no for 
electrons on gas atoms at various electron 
velocities (Ramsauer Effect). The dashed 
line represents the gas kinetic cross section G 
of the atoms. For the relation between scat- 
tering coefficient and cross section see (2.21) 



Experiment 2: The Investigations of Davisson and Germer (1919); 

Their Explanation (1927) 

On reflecting slow electrons from crystals, Davisson and Germer observed interference 
effects, i.e. maxima and minima in the intensities of the reflected electrons, which were 
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Fig. 6.7. Electron interference experiment of Davisson and Germer. Left: 
Diagram of the apparatus. The electrons accelerated through the voltage V 
are reflected from selected surfaces of a single crystal of nickel and record- 
ed as a function of the reflection angle 9. Right: Angular distribution of 
reflected electrons at various accelerating voltages. The diagrams shown 
are polar plots in which the distance from the zero point to the curve corre- 
sponds to the intensity of reflection at the corresponding angle. There is a 
maximum at a scattering or reflection angle of about 50°, due to inter- 
ference 



uniquely determined by the electron velocities, the crystal orientation, and the angle of 
observation. Their experimental setup and the results are shown schematically in 
Fig. 6.7. The interference maxima and minima come about in a manner similar to x-ray 
diffraction from the lattice planes of a crystal (Bragg reflections, see Sect. 2.4.5). The 
occurrence of interference means that the motion of the electrons must be connected 
with a wave phenomenon. Indeed, de Broglie put forth the suggestion that just as light 
can possess a particle character, electrons must also have a wave character; he assumed 
the validity of the fundamental relation p = h/X between the momentum and the wave- 
length. 

If we express the momentum by means of the mass and the velocity, i.e. p = 
and set v = ]/2£'kin/^o for non-relativistic velocities, we find 



X = h/\/2moEan . 



(6.13) 



It follows that for electrons which have been accelerated by a voltage F, 

12 3 o 

X = l±l[A]. 

]/v 

The wavelength is measured in angstroms and the kinetic energy is converted to eV, 
because the electrons acquire their kinetic energy by traversing a voltage V. An 
accelerating voltage of 54 V, for example, produces X = 1.67 A. 

De Broglie'^ hypothesis applies to all particles, not only electrons. The values given 
in Table 6.2 are for electrons. 



Table 6.2. Wavelength of electrons in A corresponding to various energies [eV], according to de Broglie 



^kin [eV] 10 100 10^ 10^ 10^ 10^ 10^ 10^ 

A [A] 3.9 1.2 0.39 0.12 3.7-10“^ 8.7- 10“^ 1.2- 10"^ 1.2- 10"^ 



Many other experiments were suggested and could now be understood. 
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Experiment 3: Fresnel Diffraction from a Sharp Boundary, Boersch (1956) 

One of the basic diffraction experiments in optics is the diffraction by a semi-infinite 
plane. Like light, electrons can be diffracted from a sharp boundary. In this experiment 
they are diffracted from the edge of an extremely thin foil of AI 2 O 3 (Fig. 6 . 8 ). 






Fig. 6.8. Above: Diffraction lines of filtered red light at 
the geometric shadow boundary of a semi-infinite plane. 
Middle: Diffraction of electrons from the edge of an 
AI2O3 foil in a semi-infinite plane. A = 5 • 10“^^ m, cor- 
responding to an electron energy of 3.4 • 10^ eV. Below: 
Diffraction of electrons from a 2 pm thick gold-coated 
wire. Electron energy 1.94- 10"^ eV. All photos from 
R.W. Pohl, Optik und Atomphysik, 11, 12th ed. 
(Springer, Berlin, Heidelberg, New York 1967) Figs. 
202, 522, 523 



Experiment 4: Diffraction from a Fresnel Double Prism, Mollenstedt (1956) 

The Fresnel double prism experiment of classical optics was carried out with electrons. 
In this experiment, an electrically charged quartz fibre acts a double prism for elec- 
trons. Electrons from the two virtual electron sources interfere (Fig. 6.9). Measurement 
of the resulting interference lines in the image plane confirmed the de Broglie rela- 
tionship to within 0.5% (Fig. 6.10). 

Experiment 5: Atoms as Waves (1931) 

The wave character of particles other than electrons was also demonstrated by inter- 
ference experiments. Stern, Frisch and Estermann (1931) observed the diffraction of 
beams of helium atoms from the surface of a LiF crystal. The wavelength of the helium 
atoms is derived from the temperature T = 400 K and the mean kinetic energy 
6 = 3 kT/2 of the atomic beam. It lies in the angstrom range: 

Ahc = h/\/lmE = h/]/3m^J ^ . 

This relationship has also been experimentally confirmed. 

Interference and diffraction with particle beams are now included among the 
routine physical methods. Electrons can be used in the same interference experiments 
as x-rays. 
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Fig. 6.10. Electron interference from the electrostatic double prism according to Mdllenstedt and 
Duker. This image shows the measured intensity distribution on the film in Fig. 6.9; from Gerth- 
sen, Kneser, Vogel: Physik, 13th ed. (Springer, Berlin, Heidelberg, New York 1977) 




Fig. 6.9. Electron interference with an electrostatic double prism according to Mdllenstedt and Diiker. There 
is a voltage between the fibre and the counterelectrodes. The electrons are deflected by the resulting inhomo- 
geneous field as shown. From Gerthsen, Kneser, Vogel: Physik, 13th ed. (Springer, Berlin, Heidelberg, New 
York 1977) Fig. 10.68 




Fig. 6.11. Neutron diffraction from a single crystal, von Laue 
arrangement. With polyenergetic neutrons, one obtains Laue 
diagrams from scattering on the single crystal 




Fig. 6.13. Neutron diffraction from diamond powder, ac- 
cording to G. Bacon. Diffraction maxima occur from four 
families of lattice planes, which are denoted by the crystallo- 
graphic indices (111), (220), (311) and (400) 




Fig. 6.12. Neutron spectrometer 
according to E. V. Wollan, C. G. 
Shull, Phys. Rev. 73, 830 (1948). 
The neutrons are monochromatised 
by reflection from a NaCl crystal. 
When diffracted from a polycrys- 
talline sample, they generate inter- 
ference rings according to Debye- 
Scherrer. They are measured by 
means of a BF 3 counter 
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Neutron diffraction methods have become important in solid state physics. They 
are among the most useful means of determining crystal structures, analysing magne- 
tically ordered systems (due to the existence of a neutron magnetic moment), and detec- 
tion of lattice oscillation spectra in crystals. 

Figure 6.11 shows schematically an arrangement for measurement of neutron dif- 
fraction from single crystals according to von Lane, It is completely analogous to the 
x-ray arrangement (Fig. 2.7). High intensity neutron beams are most conveniently 
obtained from a nuclear reactor. They are produced by nuclear fission with a velocity 
distribution which is continuous, within certain limits. If one wishes to work with par- 
ticles of a single wavelength, or according to de Broglie, with uniform velocity, one 
must monochromatise the neutron beam. 

This can be done, for example as shown in Fig. 6.12, by reflection from a single 
crystal (here NaCl). If these monochromatic neutrons are directed onto a polycrystal- 
line or powdered sample, one obtains the same interference patterns as with the Debye- 
Scherrer x-ray technique (Fig. 2.10). An example of such a pattern is shown in 
Fig. 6.13; it is the result of diffraction of a neutron beam from diamond powder. 



6.6 Interferometry with Atoms 

The wave nature of particles can be employed to construct an interferometer using 
atomic beams, or to set up an imaging system based on interference of atoms, similar 
to those known from photon optics. 

The problems encountered in this atomic interferometry can be demonstrated by 
making use of the quantum-mechanical analog of Young's two-slit experiment, which 
is well-known from classical optics; see Fig. 6.14. A beam of helium atoms is excited 
into the metastable state 2^ So or 2 ^Sq by electron bombardment (the meaning of the 
term symbols is explained in Sect. 17.1). The corresponding de Broglie wavelength of 
the He atoms is found from (6.13) using the velocity of the He atoms to be A = 0.56 A 
at 300 K, and A = 1 .03 A at 77 K. The excited atoms now pass through a slit A of width 
2 |Lim and reach the double slit B. It consists of two slits of width 1 |Lim separated by 
a distance of 8 pm. The atoms which pass through these two slits, i.e. the correspond- 
ing de Broglie waves, interfere in the region behind B, The resulting interference pat- 
tern is registered in the plane C by a photoelectron multiplier, which measures the light 





Fig. 6.14. The experimental setup for carrying out Young *s two-slit interference experiment with atoms (after 
O. Carnal and J. Mlynek, Phys. Rev. Lett. 66, 2689 (1991)). A beam of helium atoms is excited by electron 
bombardment and passes through a slit 2 pm wide in plane A, then reaches plane B containing two slits of 
width 1 pm and spacing 8 pm. The resulting interference pattern behind plane B is observed via a third slit 
in plane C using a secondary electron multiplier (PM) 




Problems 
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Fig. 6.15. The two-slit experiment in atomic interferometry carried out with the ex- 
perimental setup shown in Fig. 6.14: the intensity distribution in plane C. To in- 
crease the attainable resolution, the measurement is performed using a mechanical 
grating which can be moved within plane C and has the “correct” lattice spacing. 
When the slits in the grating coincide with the interference maxima, the measured 
intensity at the electron multiplier PM shows a maximum; it is at a minimum when 
the grating is shifted by one-half its lattice-spacing period. The lattice spacing here 
was 8 |xm 



emitted by the metastable He atoms as they return to their ground states. Figure 6.15 
shows the result of such a measurement. One observes a system of equidistant in- 
terference maxima separated by a distance which corresponds to what one would 
calculate from the geometry of the experiment and the de Broglie wavelength of the 
atomic beam. This provides an impressive proof of the wave nature of atoms and the 
validity of (6.13). 

The biprisma experiment, in which an electrically-charged wire serves as interfero- 
meter (cf. Fig. 6.9), has also been carried out with atoms and was used to determine 
their de Broglie wavelengths [S. Nowak, N. Stuhler, T. Pfau, and J. Mlynek: Phys. 
Rev. Lett. 81 , 5792 (1998)]. 

In this manner, the diffraction of atoms by micromechanical structures such as mir- 
rors, lenses, and beam splitters for matter waves can be employed to produce an 
‘atomic optics’. Image formation has already been demonstrated using a Fresnel zone 
plate, for example (O. Carnal, J. Mlynek et al., Phys. Rev. Lett. 67, 3231 (1991)). The 
way has thus been opened for the construction of microprobes on an atomic scale, with 
which one could, for example, study surfaces with a high spatial resolution, namely 
that of the de Broglie wavelength, and nondestructively. Since the energy of the atoms 
can be made very small, less than 10 meV, little radiation damage would be produced 
on the surface under investigation. The high spatial resolution is made possible by the 
short wavelengths. This area of atom lithography is certainly an interesting field for 
future research. 



Problems 

6.1 In the Millikan experiment to determine the elementary charge, a voltage of 
F = 50 V is applied between the plates of a capacitor, which are 1 cm apart. What must 
the diameter 2r of the oil droplets be, in order that a droplet with a single charge is held 
in suspension? What is the velocity v of the droplets as they fall when the direction of 
the electric field is reversed? The coefficient of viscosity of air is a/ = 1 .84 x 10 poise, 
and the density of the oil g = 0.9 g/cm^ 

Hint: The frictional force Fr is given by Stokes’ Law, F^ = -6nrjrv (1 poise = 
1 dyns/cm^= 10"^ Ns/m^). 
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6.2 A singly charged particle with 0.12 MeV of kinetic energy moves through a 
transverse homogeneous magnetic field with field strength B= 100 G (IG = lO'^^T). 
Calculate the mass of the particle, given that the deflection from the original direction 
of travel is 3 mm in a 10 cm flight path. 

6.3 To measure their specific charge, electrons are accelerated through a voltage V, 
Then they pass through the transverse fields of two small plate capacitors, which are 
placed at a distance / from one another. Both capacitors are connected to a single fre- 
quency generator (frequency v). When the frequency is suitably adjusted, the electrons 
leave the second capacitor in their original direction of travel. 

a) Under what condition is this possible? 

Derive a relation between e/m and the experimental data. 

b) What is the minimum frequency required from the generator if F = 500 V and 
1= 10 cm? 

c) Sketch the apparatus. 

6.4 If the kinetic energy of an electron is equivalent to its rest mass, what is its 
velocity? 

6.5 The rest energy of the electron is 0.511 MeV. Give the ratio of inertial mass to 
rest mass for an electron as a function of its kinetic energy. How large is the ratio for 

^kin=lMeV? 

6.6 Calculate the de Broglie wavelength of an electron with the velocity v = 0.8c, 
using relativistic relationships. 

6.7 Calculate the de Broglie wavelength of an electron with kinetic energy of 1 eV, 
100 eV, 1000 eV, 100 keV. Which wavelengths will be noticeably diffracted in a nickel 
crystal, in which the atomic spacing is about 2.15 A? Calculate the kinetic energy of 
those electrons which are scattered through angles of less than 30°. 

6.8 What is the average kinetic energy and the corresponding de Broglie wavelength 
of thermal neutrons, i.e., neutrons which are in thermal equilibrium with matter at 
25 °C? According to the Bragg formula, what is the angle of incidence at which the first 
interference maximum occurs when these neutrons are reflected from a NaCl crystal in 
which the lattice spacing c/ is 2.82 A? The mass of the neutron is 1.675 x 10"^^ kg. 

6.9 Consider an electron which is far away from a proton and at rest. It is attracted to 
the proton. Calculate the magnitude of the wavelength of the electron when it has 
approached within a) 1 m and b) 0.5 x 10“^^ m of the proton. (The latter distance is of 
the same order as the orbital radius of the electron in the ground state of the hydrogen 
atom.) 




7. Some Basic Properties of Matter Waves 



7.1 Wave Packets 

In the two preceding chapters it was shown that light, electrons and other elementary 
particles can have both wave and particle characteristics. In this chapter we will 
examine more closely how the wave properties of matter can be understood and 
described mathematically. 

For both light and material particles there are basic relationships between energy 
and frequency, and between momentum and wavelength, which are summarised in the 
following formulae: 

Light Matter 

E = hv E = hv= hco 

(7.1) 




Fig. 7.1. Instantaneous view of a wave with 
amplitude Aq and wavelength A 



We now wish to expand these relationships into a more exact theory. We are 
familiar with descriptions of wave motion from the study of light. If we consider a 
plane monochromatic wave (Fig. 7.1) travelling in the x direction, the wave amplitude 
A at time t and point x '\s A (x, /) = A(^cos{kx- cot). The wave number k is related to 
the wavelength A by A: = 2n/X. The circular frequency co is related to the frequency by 
(JO = In V. In many cases it is more useful to use complex notation, in which we express 
the cosine by exponential functions according to the formula 



cosa=|(e‘“+e-'“). 



(7.2) 
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We accordingly expand t): 

A{x,t) = AQ\{Qx^{\kx-\o}t) + Qxx)(-\kx+'\o}t)] . (7.3) 

Applying the relations (7.1), we obtain 



Qx^(\kx-iu>t) = exp 



— (px-Et) 
h 



(7.4) 



The wave represented by (7.4) is an infinitely long wave train. 

On the other hand, since we ordinarily assume that particles (“point masses”) are 
localised, we must consider whether we can, by superposing a sufficient number of 
suitable wave trains, arrive at some spatially concentrated sort of “wave”. We are 
tempted to form what are called wave packets, in which the amplitude is localised in a 
certain region of space. In order to get an idea of how such wave packets can be built 
up, we first imagine that two wave trains of slightly differing frequencies and wave- 
numbers are superposed. We then obtain from the two amplitudes >li(jc, t) andAiix, t) 
a new amplitude A (jc, t) according to 



A{x,t)=Ai(x,t)+A 2 (x,t), (7.5) 

or, using cosine waves of the same amplitude for A^ and A 2 , 

A{x,t)= Ao[c,o%(kxX-(Oxt) + co5{k2X-ou2t)\ . (7.6) 

As we know from elementary mathematics, the right-hand side of (7.6) may be ex- 
pressed as 

2AQCo^{kx- cot) co^{Akx- Acot ) , (7.7) 

where 

k = \(k^ + k2), a> = |(aj, -f ( 02 ) , 

and 

Ak = Yik\ — k2), /d o> = y(£Uj — 0 ) 2 ) . 

The resulting wave is sketched in Fig. 7.2. The wave is clearly amplified in some regions 
of space and attenuated in others. This suggests that we might produce a more and 
more complete localisation by superposing more and more cosine waves. This is, in 
fact, the case. To see how, we use the complex representation. We superpose waves of 
the form (7.4) for various wavenumbers k and assume that the wavenumbers form a 
continuous distribution. Thus, we form the integral 

fcQ + A k 

I aexp[i(kx-a>t)]dk= y/(x,t) , (7.8) 

kQ- Ak 

where a is taken to be a constant amplitude. 




7.1 Wave Packets 
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Fig. 7.2. Superposition of two waves of the 
same amplitude. Fundamental wave 1: 

( ), fundamental wave 2: ( ) same 

amplitude as 1. Resulting wave A: ( ). 

The envelope cos {Akx- Acot) for constant t 
is also shown as a dashed curve 



X 



In taking this integral, we must notice that co and k are related to one another, since 
the energy and the momentum of an electron are connected by the relation 
E = p^/{2mo), and this in turn means that cu and k are related according to (7.1). To 
evaluate the integral we set 



k = k()-\- (k — k{)) (7*9) 

and expand a> about the value k^ using a Taylor series in {k-k^), which we terminate 
after the second term: 



CJ — 



dco 



{k-ko) + ... 



(7.10) 



In the following, we abbreviate dco/dk as a>'. Inserting (7.9) and (7.10) in (7.8), we 
obtain 

Ak 

v/(x,t) = aexp[-i(coof-^o^)] I exp[-i(co't-x)<J]rf(J, (7.11) 

-Ak 

where we have set (k-ko) = The remaining integral may be evaluated in an elemen- 
tary manner and (7.11) finally takes the form 



t) = aexp(-icuot-t-i/to^) • 2 



sin [(o}' t-x)Ak] 
to' t-x 



(7.12) 



The real part of y/ is shown in Fig. 7.3. 

We can draw two important conclusions from (7.12): 

1) The wave packet represented by y/ is strongly localised in the region of x = cu' t. 
The maximum amplitude moves with a velocity cu' = dco/dk. With the help of (7.1), 
we can express cu and k in terms of E and p, obtaining cu' = dE/dp, or, if we use the 
standard relation E = p^/2mo, finally cu' = p/rriQ= fparticie- In order to understand this 
result, we recall the concepts of phase velocity and group velocity. 
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Fig. 7.3. The real part of \j/{x, t) as a func- 
tion of the position x. The rapid oscillations 
are described by cos(/:oX- oj^t) with t fixed. 
The envelope is given by ^\\\{{o)' t - x) A k]/ 
{oj' t-x) with t fixed. Note that the scale of 
the X axis has been greatly reduced in com- 
parison to Fig. 7.2 



X 



If we let the time variable increase in the wavefunction co^{kx- cot), then the 
position Xjnax at which a particular wave maximum is to be found moves according to 
the relation cot = 0, i.e. = {oj/k) t. The position x^ax thus moves with the 

phase velocity Fp^ase = 

If we replace cohy E and khy p according to (7.1), we find immediately that this v 
does not equal the particle velocity. On the other hand, we have just seen that the 
maximum of a wavepacket moves with the velocity Vq = doj/dk. This velocity of a 
wave group (wavepacket) is called the group velocity. Thus the group velocity of the 
de Broglie waves (matter waves) is identical with the particle velocity. 

We could be tempted to unify the wave and particle pictures by using wave packets 
to describe the motion of particles. This is unfortunately not possible, because in 
general, wave packets change their shapes and flow apart with time. We are therefore 
compelled to adopt a quite different approach, as will be shown below. 

2) A second implication of the result (7.12) is the following: The width of a wave 
packet is roughly the distance between the first two zero points to the left and right of 




Fig. 7.4. The envelope of the real 
part of the wave packet (7.12) (Fig. 
7.3). The first node is at Vq = n/ Ak 
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the maximum (Fig. 7.4). Since the first zero point is at Xq= n/Ak, the width of the 
wave packet would be zlx = 2 n/ Ak, The more we wish to concentrate the wave packet, 
i.e., the smaller we make Ax, the larger we must make the /r-region, or Ak. 

In order to clarify the relationship between the particle and the wave descriptions, 
we shall consider the experiment described in the following section as we have already 
for light. 



7.2 Probabilistic Interpretation 

We wish to illustrate, using the electron as an example, how one can unify the wave and 
particle descriptions. To determine the position of an electron in the x direction 
(Fig. 7.5), we allow an electron beam to pass through a slit with a width Ax. We can 
thus ensure that the electron coming from the left must have passed through this 
position. Now, however, the wave properties come into play, and the electron is 
accordingly diffracted by the slit. A diffraction pattern is produced on the screen S 
(Fig. 7.5). According to wave theory, the intensity of the diffraction pattern is propor- 
tional to the square of the amplitude. When we consider the electron as a wave, and 
take (// as its wave amplitude, we obtain the intensity I = \ |//(x, t) f at time t and 
position X on the observation screen. It is better, for both mathematical and physical 
reasons, not to speak of the intensity at a point in space, but rather of the intensity in 
the three-dimensional region dx, dy, dz around the point x, y, z. 
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Fig. 7.5. An electron beam (arrows at left) 
passes through an aperture and generates a 
diffraction pattern on a screen. The intensity 
distribution on the screen is shown schemat- 
ically on the right 



Therefore, in the following we shall consider the intensity in a volume element 
dV = dxdydz: 

Idxdydz= \ y/{x,y,z,t)\^dxdydz. (7.13) 



(Compare this to the one-dimensional example in Fig. 7.6.) 
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Fig. 7.6. |(//(x)|2 as a function of x at a given time t. The shaded area corresponds to the probability that the 
electron is located in the interval Xq to Xq + dx 



Now comes the essential point. The screen can also be considered as an apparatus 
which detects electrons individually as particles. A fluorescent screen flashes at the 
point of impact each time an electron hits it. The electron is thus highly localised, and 
there is no diffraction pattern. If we repeat the experiment, we observe other flashes of 
light, and in general these are at different points on the screen. Only when we carry out 
many experiments, or allow many electrons to pass through, do we obtain a diffraction 
pattern of the form described by (7.13). This is the key to the explanation of the phe- 
nomenon of “wave-particle duality”. On the one hand, the intensity of the diffraction 
pattern in a volume AVi^ proportional to the absolute square of the amplitude, 

W?^V, (7.14) 

and on the other, it is proportional to the probability of finding the electron in A F. 
\\f/\^AV is thus itself proportional to the frequency of finding the electron in AV. 

I y/{x,y,z, t) l^dxdydz must therefore be seen as probability of finding the electron 
in a volume element dV about the point x,y,z- 

Because the statistical interpretation of quantum mechanics will be mentioned fre- 
quently, and is absolutely necessary to an understanding of the subject, we shall spend 
a bit more time on the concept of probability. Let us compare a quantum mechanical 
experiment with a game of dice. Since a die has six different numbers on its faces, it 
has, so to speak, six different experimental values. We cannot say in advance, however, 
which face, i.e. which experimental value we will obtain in any given throw. We can 
only give the probability of obtaining the value n. In the case of a die, P„ is very easy 
to determine. According to a basic postulate of probability theory, the sum of all prob- 
abilities Pn must be one (i.e., one face must come up on each throw): 

(7.15) 

n 



Since all the numbers « = 1, 2, ... 6 are equally probable, the six values of must be 
equal, so = 1/6. 
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It is not so easy to determine \ ij/^dxdydz. We can infer from the above, however, 
that there must be a normalisation condition for \ {i/'^dxdydz- If we integrate over all 
points in space, the particle must be found somewhere, so the total probability must 
therefore be equal to 1. We thereby obtain the basic normalisation condition 

\\\!/(x,y,z)fdxdydz = \. (7.16) 

We shall illustrate the use of this normalisation condition with two examples. 

1) We assume that the electron is enclosed in a box with volume V. The integral 
(7.16) must then extend only over this volume. If we use for \j/ the wavefunction 

yj = AQtxp(\k • x-\a>t) , (7.17) 

where k x = kxX + kyy + k^z, then Ag must be 

Ag=V~'^^. (7.18) 

2) If the space extends to infinity, there is a difficulty, because here ^ if we 
simply allow V to go to infinity in (7.18). It can be shown, however, that a generalised 
normalisation condition can still be derived. In one dimension the normalised wave- 
function is 

i//k(x,t) = (i/\/2n) Qxp{ikx-\ojt ) , (7.19) 

and the normalisation condition is 

\y/iix,t)y/kixA)dx = d(k-k') . (7.20) 

Here (?(/:- A:') is the Dirac S function (see Appendix A). 

The probabilistic interpretation of the wavefunction is also necessary for the fol- 
lowing reason: if the impact of an electron on the screen were to cause it to flash at 
more than one point, this would mean that the electron had divided itself. All experi- 
ments have shown, however, that the electron is not divisible. The determination of 
\\f/\^dV allows us only to predict the probability of finding the electron in that volume. 
If we have found it at one position (localised it), we are certain that it is not somewhere 
else as well. This is evidently a “yes-no” statement and leaves no ambiguity for an 
individual electron. If we consider the reflection of electrons in this way, and observe 
that 5% are reflected, it means this: if we carry out a very large number of experiments, 
5% of all the electrons would be reflected. It would be completely false, however, to 
say that 5% of a single electron had been reflected. 



7.3 The Heisenberg Uncertainty Relation 

We now consider some of the implications of the fact that the electron sometimes acts 
as a particle and sometimes as a wave. As we calculated earlier, the one-dimensional 
distribution of the wave packet is 

, ^ sin(xAA:) 
y/{x) ~ 



(7.21) 
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If we take the position of the first zero point as a measure of the uncertainty in the 
position, we obtain from (7.21) (Fig. 7.4) the relation 

— = — . (7.22) 

2 Ak 

The uncertainty in the position is clearly connected with the uncertainty in the wave- 
numbers k. But the wavenumber is related to the momentum by the equation 

p = hk. (7.23) 

If we insert this in (7.22), we obtain the basic Heisenberg uncertainty relation 



AxAp^h (7.24) 

(a mathematically precise formulation and derivation can be found in Appendix C). 
This relation states that it is impossible to measure the position and the momentum of 
an electron exactly at the same time. A lower bound to the simultaneous measurability 
is given by (7.24). Indeed, if we wished to let Ax go to zero (exact determination of the 
position), we would have to allow Ap to become infinite, and vice versa. The fact that 
we notice nothing of this uncertainty relation in daily life is a result of the smallness of 
Planck’s constant h. If, on the other hand, we consider the microscopic world, then we 
can only understand the results of experiments if we take the finite size of the constant 
h into account. We will clarify the meaning of (7.24) with the example of an experi- 
ment. 

An electron is moving in a horizontal direction (y). We wish to determine its co- 
ordinate in the perpendicular (x) direction. For this purpose, we set up a collimator per- 
pendicular to the direction of motion with a slit of width d = Ax. If the electron passes 
through this slit, then we know that it was at that position with the uncertainty Ax. 
Now, however, we must take into account the wave nature of the electron. From the 
theory of diffraction we know that a wave produces a diffraction pattern on the obser- 



1 



X 




y 



Fig. 7.7. Diffraction of a wave 
by a slit 
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vation screen after passing through a slit of width d (Fig. 7.7). The angle 0 at which the 
first diffraction minimum occurs is given by 

sin0 = A/rf. (7.25) 

If we denote the total momentum of the electron by p, the projection of the momentum 
on the X axis is p sin0. This x component in the momentum is produced by diffraction 
of the electron wave at the slit; the resulting uncertainty in the momentum is then 

AlPx = Psin0. (7.26) 

If we once again use the relation 

p = h/k (7.27) 

and insert (7.26) and (7.27) in (7.25), we again obtain the Heisenberg uncertainty 
relation (7.24). 

This example shows clearly that a measurement of one quantity, here the position, 
immediately produces a perturbation of the complementary quantity, namely the 
momentum. Before we set up the collimator with its slit, we could have determined the 
momentum of the electron. The result would have been that the electron was moving 
exactly in the y direction, i.e., that its momentum component in the x direction was 
exactly equal to zero. In the above experiment, we were able to determine the position 
with a certain accuracy, but we had to accept the fact that the momentum thereby 
became uncertain in the x direction. There is also a relation between energy and time 
which is analogous to (7.24). 



7.4 The Energy-Time Uncertainty Relation 

In the wavefunction ~exp(i/:x- i which was the starting point of this chapter, the 
position X and the time t occur in a symmetric fashion. Just as we could form wave 
packets which exhibited a certain concentration in space, we can also construct wave 
packets which have a concentration about a time t with an uncertainty A t. Instead of 
the relation AxAk we then have 

AtAco^ln. (7.28) 

Utilising the relation E = hco, we find from this that 

AEAt^h, (7.29) 

This relation, which we shall discuss in more detail at a later point in the book, states 
among other things, that one must carry out a measurement for a sufficiently long 
time, in order to measure an energy with good accuracy in quantum mechanics. 
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7.5 Some Consequences of the Uncertainty Relations for Bound States 



In the preceding sections of this chapter we have explicitly considered free electrons. In 
the next chapters we shall be concerned with the experimental and theoretical questions 
associated with bound electrons, for example in the hydrogen atom. In this section we 
shall to some extent anticipate the presentation in the rest of the book. The reader will 
recognise even in this section that wave mechanics will play a fundamental role in the 
treatment of bound states. 

We will consider the hydrogen atom as the simplest case of a bound state. We 
assume that the electron travels around the nucleus in an orbit, as a planet around the 
sun. Why the electronic shells of the atom have a finite extent - why there is a smallest 
electron orbit - was an insoluble problem in classical physics. 

The energy of an electron is equal to the sum of the kinetic and the potential energy. 



^class '^kin“^^pot* (7.30) 

If we express the kinetic energy of a particle £'kin = (mo/2)y^ in terms of the 
momentum p, and substitute the Coulomb potential energy - e^/iAne^r) for Epot, the 
expression for E is 



E = 







(7.31) 



Here r is the distance of the electron from the nucleus. 

It can be shown in classical mechanics that £’dass = - pV(2 • An 

If we allow r to go to zero, the energy naturally goes to - oo. In other words, the 
energy decreases continually and there is no smallest orbital radius. Let us now 
consider the expression (7.31) from a “naive” quantum mechanical point of view. Then 
“orbit” would mean that we have the electron concentrated at a distance of approxi- 
mately r from the nucleus. The positional uncertainty would therefore be of the order 
of r. This, however, would entail uncertainty in the momentum p of the order of h/r, 
which in turn establishes a minimum for the order of magnitude of the kinetic energy 
(Fig. 7.8). If we therefore substitute 

/?«— (7.32) 

r 



in (7.31), we realise that the minimum of the energy expression 



1 h^ 



2 mo 



= Min 

47reoA‘ 



(7.33) 



is no longer at r= 0. If we let r go to zero, the kinetic energy would increase very 
rapidly. We shall leave the determination of the minimum of (7.33) to the reader as an 
simple exercise in differential calculus and give the result immediately. The radius is 

h^4nsQ 
moe^ 



r = 



(7.34) 
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Fig. 7.8. This figure is meant to clarify the competition between kinetic and 
potential energies as a function of the distance r <->• position uncertainty in the 
hydrogen atom, based on the Heisenberg uncertainty relation. ( ) poten- 
tial energy, ( ) kinetic energy, ( ) total energy = sum of potential and 

kinetic energies. An energy minimum is seen to result 
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If we substitute this r in (7.33), the corresponding energy is 



1 e^nio 

J {4nsofh^ ' 



(7.35) 



When we substitute the known numerical values for Planck’s constant, and the mass 
and charge of the electron, we obtain a radius of about 10“^ cm, which is the right 
order of magnitude for the hydrogen atom. As we shall see later, the exact quantum 
mechanical calculation of the energy yields 



4 4 

1 e rriQ 
Y i4nsofn^ ' 



(7.36) 



The only difference between (7.35) and (7.36) is the factor = {h/2n)^ which re- 
places h^. 

The Heisenberg uncertainty principle also allows us to calculate the so-called zero- 
point energy of a harmonic oscillator. Here we consider the motion of a particle elas- 
tically bound by a spring with a spring constant /. Since the elastic energy increases 
quadratically with the displacement x and the kinetic energy again has the form 
p^/2mo, the total energy is 



E = 



f 2 

2 mo 2 



(7.37) 
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Fig. 7.9. Illustrating the competition between 
kinetic and potential energies as a function of 
the displacement position uncertainty in the 

harmonic oscillator. ( ) potential energy, 

( ) kinetic energy, ( ) total energy. The 

classical energy minimum at Xq = 0 is shifted to a 
finite value 



In classical physics this energy is at a minimum when both the momentum and the 
position are zero, i.e., the particle is at rest. However, sinee according to the Heisen- 
berg relation an exact position is associated with an infinite momentum, we allow a 
positional uncertainty of the same magnitude as the oscillation amplitude Xq and have 
the corresponding momentum uncertainty according to (7.24), where Xq assumes the 
role of r (Fig. 7.9). We again require that the total energy is minimised by the appro- 
priate choice of Xq: 



E = 



2moXo 



/.2 



Xq = Min . 



(7.38) 



Solving this equation for Xq yields the amplitude of the harmonic oscillator. 



4 



^0 = 



mof 



(7.39) 



The energy then has the form 



E = hoj. (7.40) 

As we shall see later, an exact quantum mechanical calculation yields the relations 

E=\h(o (7.41) 

and 




2moco 



(7.42) 




Problems 
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It follows from these considerations that atomic, elastically bound particles are 
fundamentally incapable of being at rest. Such elastically coupled particles occur, for 
example, in crystal lattices. The quantum theory predicts that these atoms will con- 
stantly carry out zero-point oscillations. 



Problems 

7.1 Normalise the wave packet 

y/(x,t) = NTexp ^ 

-» [ 2{Ak)^ 

for / = 0. Then calculate y/{x, t) for a free particle of mass rriQ for ^ > 0. Does the nor- 
malisation hold for ^ > 0? On the basis of the occupation probability, decide whether 
the wave packet falls apart. What is the significance of 




Hint: Use the relation 

— 00 — 00 

(completing the square ! ) 

The second integral can be converted to the Gaussian integral by changing the 
coordinates. 

7.2 By the appropriate choice oi Akm Problem 7. 1 , let the probability of locating the 
wave packet outside Ax= 10“^ cm be zero. How long would it take Ax to attain the 
size of the distance between the earth and sun ( — 150 million km)? 

Hint: Choose Ax so that \f/{Ax,0) = 1/e [e = exp(l)!] 

7.3 Treat Problems 7.1 and 7.2 in three dimensions. 




8. Bohr’s Model of the Hydrogen Atom 



8.1 Basic Principles of Spectroscopy 

In the following chapters we shall take up the detailed analysis of the spectra of atoms 
in every wavelength region. The most important sources of information about the elec- 
tronic structure and composition of atoms are spectra in the visible, infrared, ultra- 
violet, x-ray, microwave and radio frequency ranges. Figure 8.1 summarises these 
spectral regions. 

Optical spectra are further categorised as line, band and continuous spectra. Con- 
tinuous spectra are emitted by radiant solids or high-density gases. Band spectra consist 
of groups of large numbers of spectral lines which are very close to one another. They 
are generally associated with molecules. Line spectra, on the other hand, are typical of 
atoms. They consist of single lines, which can be ordered in characteristic series. 



Wave- Frequency Spectral Photon Wave- 

number region energy length 

V [cm‘^] E [eV] A [cm] A [A] 




Fig. 8.1. The electromagnetic 
spectrum. Regions and units 
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Optical spectra can be observed either by emission or by absorption. The former 
mode requires that the substance to be examined be made to emit light; this can be 
achieved by transferring energy to the atoms by means of light, electron collisions, 
x-ray excitation or some other process. If a substance re-emits the light it has absorbed, 
the process is called resonance fluorescence. The best known example of this is the 
resonance fluorescence of sodium vapour (Fig. 8.2). 



Resonance light 




Fig. 8.2. Resonance fluorescence of 
sodium vapour. Sodium metal is 
heated in an evacuated glass sphere. 
The resulting sodium vapour absorbs 
the light of a sodium vapour lamp 
and emits the same light as resonance 
fluorescence in all directions 



Details of apparatus will be discussed in the following sections in connection with 
particular problems. 

Spectra are traditionally measured in several different units, due to the features of 
the apparatus or for practical reasons: 

- In wavelength units. These can be determined absolutely with a diffraction grating. 
Usually, however, one uses a calibrated comparison spectrum, which allows greater 
accuracy. 

One wavelength standard is the yellow ^^Kr line, that is a yellow line in the spectrum 
of the ^^Kr atom. For this line, 

Avac = 6057.80211 A v = 16507.6373 cm“^ (see below) . 

In general the wavelengths are referred to vacuum. The corresponding wavelength 
in air is somewhat smaller, because the index of refraction of air is somewhat greater 
than 1, and the velocity of light in air is thus somewhat less than in a vacuum. To 
convert wavelengths measured in air (“normal” air, 15 °C, 760Torr), the formula is 



The refractive index of air is a function of the wavelength. At 6000 A, n = 1.0002762. 
For the yellow ^^Kr line in normal air, 

Aair = 6056.12941 A. 

- Specifying the frequency is more general, since it is not dependent on the medium. 
We have: 



V ^AAyac c/(AlAair) • 
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- A frequently cited quantity is the wavenumber. 

v= v/c= l/(«Aair) . 

The wavenumber is, like the frequency, a quantity proportional to the energy; con- 
version may be made according to the equation 

E = vhc . 

- Finally, the unit electron volt (eV) is often used as a measure of the energy. 
Several units which are important and practical in atomic physics as well as conver- 
sion factors are set out in Table 8.1 and in Fig. 8.1. 



Table 8.1. Frequently used units and conversion factors (see also the table on the inner side of the front 
cover) 



Quantity 


Unit and conversion factor 


Wavelength A 


1 A = 10"'®m = 0.1 nm 


Wavenumber v 


1 cm “ ^ ( = 1 kayser) 


V- 1/A 


v = 8066 £(eV) cm"’ 

1 cm"' =29.979 GHz 


Energy E 


1 electron volt = 1.602 • 10“^^ J = 1.96 • 10“^ 
E = hv = hc/X = hcv 
1 eV ^ 2.418 • 10^4 Hz ^ 8066 cm~^ 

E(eV) = 1.24-10"‘' — 
cm 


Mass Wq 


1 electron mass = 9.11 • 10"^' kg = 511 keV/c^ 


Charge e 


1 elementary charge = 1.6 ■ 10"’^ C 


Planck’s constant h 


/, = 4.14 - 10"’^ eV s 
^=ft/27T = 6.58-10"'^eVs 



8.2 The Optical Spectrum of the Hydrogen Atom 



Kirchhoff 2 ind Bunsen, the founders of spectroscopic analysis, were the first to discover 
in the mid- 19th century that each element possesses its own characteristic spectrum. 
Hydrogen is the lightest element, and the hydrogen atom is the simplest atom, con- 
sisting of a proton and an electron. The spectra of the hydrogen atom have played an 
important role again and again over the last 90 years in the development of our under- 
standing of the laws of atomic structure and of the structure of matter. 

The emission spectrum of atomic hydrogen shows three characteristic lines in the 
visible region at 6563, 4861, and 4340 A (H^ ^;). The most intense of these lines was 
discovered in 1853 by Angstrom; it is now called the line. In the near ultraviolet 
region, these three lines are followed by a whole series of further lines, which fall closer 
and closer together in a regular way as they approach a short- wavelength limit (Ho>) 
(Fig. 8.3). 

Balmer found in 1885 that the wavelengths of these lines could be extremely well 
reproduced by a relation of the form 



A- 



n\ 



n\-A 



G. 



( 8 . 1 ) 
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Hp 

4861.3 A 



T~ 



— I 

Ha 

6562.8 A 



Hy He 
4340.5 A 4101.7 A 



Fig. 8.3. Balmer series in the hydro- 
gen emission spectrum. The conver- 
gence of the lines to the series limit 
Hqo is clearly seen 



Here is an integer, = 3, 4, . . . and G is an empirical constant. Today, we write the 
Balmer formula somewhat differently. For the wavenumbers of the lines we write 



V 



l/X = Rii 



1 

¥ 



1 



n an integer > 2 



( 8 . 2 ) 



The quantity /?h( = 4/G) is called the Rydberg constant and has the numerical value 
/?H= 109677.5810 cm“‘. 

The series limit is found for n -» oo to be 
Voo = 7 ?h/4 • 

For the further investigation of the hydrogen spectrum, astrophysical observations 
have played an important role. In the spectra of stars, photographically recorded as 
early as 1881 by Huggins, a large number of lines from the hydrogen spectrum are seen. 



Table 8.2. The first 20 lines of the Balmer series of hydrogen. The numbers quoted are wavelengths in air, the 
wavenumbers in vacuum, and the values calculated from the Balmer formula 



n 


^air [A] 


Vvac [C™ '] 




3 


6562.79 


15233.21 


15233.00 


4 


4861.33 


20564.77 


20564.55 


hJ 5 


4340.46 


23032.54 


23032.29 


6 


4101.73 


24373.07 


24372.80 


H. 7 


3970.07 


25181.33 


25181.08 


8 


3889.06 


25705.84 


25705.68 


H, 9 


3835.40 


26065.53 


26065.35 


H.,10 


3797.91 


26322.80 


26322.62 


H, 11 


3770.63 


26513.21 


26512.97 


H,,12 


3750.15 


26658.01 


26657.75 


Ha 13 


3734.37 


26770.65 


26770.42 


H^14 


3721.95 


26860.01 


26859.82 


H,15 


3711.98 


26932.14 


26931.94 


H^16 


3703.86 


26991.18 


26990.97 


H„17 


3697.15 


27040.17 


27039.89 


H„18 


3691.55 


27081.18 


27080.88 


H^19 


3686.83 


27115.85 


27115.58 


H„20 


3682.82 


27145.37 


27145.20 




8.2 The Optical Spectrum of the Hydrogen Atom 
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Using modern radio-astronomical techniques, transitions between states with extremely 
large n-values have been found; levels with n between 90 and 350 could be identified. 

The reason that many lines were discovered first in astrophysical observations and 
not by experiments on the earth is connected with the difficulty of preparing pure 
atomic hydrogen in the laboratory. Gas discharges, in which H 2 gas is decomposed into 
atomic hydrogen and excited to fluorescence, always contain fluorescing hydrogen 
molecules as well, whose spectrum overlaps the atomic-hydrogen spectrum. 

Above the series limit we observe the so-called series-limit continuum, a region in 
which the spectrum shows no more lines, but is, instead, continuous. 

A comparison of the calculated spectral lines obtained from the Balmer formula 
(8.2) with the observed lines (Table 8.2) shows that the formula is not just a good 
approximation: the series is described with great precision. The whole spectrum of the 
H atom is represented by equations of the form 



v = /?H — T ~ — r with/?' < n being integers . (8.3) 

\n n ) 

The numbers n and n' are called principal quantum numbers. Table 8.3 contains some 
of the lines from the first four series. 



Table 8.3. The wavelengths of some lines of the various spectral series in hydrogen. The series with n' = 5 
was observed in 1924 by Pfund\ it begins with a line of A = 74000 A, but is not shown in the table 





1 


2 


3 


4 


n 


Lyman 


Balmer 


Paschen 


Brackett 


2 


1216 A 










^ 82257 cm“' 








3 


1026 A 


6563 A 








^ 97466 cm-’ 


^15233 cm-’ 






4 


973 A 


4861 A 


18751 A 






^102807 cm-’ 


^20565 cm-’ 


^ 5333 cm-’ 




5 


950 A 


4340 A 


12818 A 


40500 A 




^105263 cm-’ 


^23033 cm-’ 


^ 7801 cm-’ 


A 2467 cm~^ 


Year of 


1906 


1885 


1908 


1922 


discovery 











The relation (8.3) was formulated first by Rydberg in 1889. He found, “to his great 
joy”, that the Balmer formula (8.1) is a special case of the Rydberg formula (8.3). 
Table 8.3 also illustrates the Ritz Combination Principle, which was found empirically 
in 1898. It states: 

The difference of the frequencies of two lines in a spectral series is equal to the fre- 
quency of a spectral line which actually occurs in another series from the same atomic 
spectrum. For example, the frequency difference of the first two terms in the Lyman 
series is equal to the frequency of the first line of the Balmer series, as can be seen from 
the wavenumber entries in Table 8.3. 
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Fig. 8.4. a) Term diagram of the lines of the hydrogen spectrum 
and series classification. The wavelengths of the transitions are 
given in A. The energies can be given as (negative) binding energies, 
with the zero of energy being the ionisation limit, or they can be 
given as excitation energies, beginning with the ground state, so 
that the energy of the term n^o is equal to the ionisation energy 



b) This represents the lines of the hydrogen spectrum in the term 
scheme of Grotrian [Struktur der Materie VII (Springer, Berlin 
1928)]. The symbols / and k appearing in the upper margin of the 
figure will be explained later (Sect. 8.9) 



We can conclude from observation and inductive reasoning that the frequencies (or 
wavenumbers) of all the spectral lines can be represented as differences of two terms of 
the form R/ n As we shall see in the following, these are just the energy levels of the 
electron in a hydrogen atom. The spectral lines of the hydrogen atom can be graphical- 
ly pictured as transitions between the energy levels (terms), leading to a spectral energy 
level diagram (Fig. 8.4). 

8.3 Bohr’s Postulates 

In the early years of this century, various models were suggested to explain the relation- 
ship between atomic structure and the spectral lines. The most successful of these is due 
to Bohr (1913). Following the Rutherford model, he assumed that the electrons move 
around the nucleus in circular orbits of radius r with velocity v, much as the planets 
move around the sun in the Solar System. A dynamic equilibrium between the cen- 



Wavenumber V [cm 
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trifugal force and the Coulomb attraction of the electrons to the nucleus is assumed to 
exist. Thus, for the hydrogen atom, one has 

=morco\ (8.4) 

Aneor 

The corresponding energy is the sum of the kinetic and the potential energies of the 
electrons: 



E = 



•fi'kin + E 



pot > 



where the kinetic energy, as usual, is given by m^v^/l or rriQr^ a>^/2. The potential 
energy is defined as the work which one obtains on allowing the electron to approach 
the nucleus under the influence of the Coulomb force from infinity to a distance r. 
Since the work is defined as the product of force and distance, and the Coulomb force 
changes continuously with the distance from the nucleus, we must integrate the contri- 
butions to the work along a differential path dr\ this gives 



r 




oo 



4n8or'^ 



dr' = 



4n6or 



(8.5) 



£'pot, as a binding energy, may be seen to be negative, with the zero point being the state 
of complete ionisation. The total energy is thus found to be 

E = — mor^co^- — - — . (8.6) 

2 4n8or 



Thus far, the model corresponds to that of Rutherford. 
We may rewrite (8.6) by using (8.4): 



E= - 



1 



2 • 47^^o'* 



2(4 7T£o) 



2/3 






(8.7) 



If, however, one attempts to understand the emission and absorption of light using this 
model and the known laws of classical electrodynamics, one encounters fundamental 
difficulties. Classically, orbits of arbitrary radius and thus a continuous series of 
energy values for the electron in the field of the nucleus should be allowed. But on 
identifying the energy levels which are implied by the spectral series with the values of 
the electron’s energy, one is forced to assume that only discrete energy values are 
possible. Furthermore, electrons moving in circular orbits are accelerated charges, and 
as such, they should radiate electromagnetic waves with frequencies equal to their 
orbital frequencies, v= co/ln. They would thus lose energy continuously, i.e. their 
orbits are unstable and they would spiral into the nucleus. Their orbital frequencies 
would change continuously during this process. Therefore, the radiation emitted would 
include a continuous range of frequencies. 

In order to avoid this discrepancy with the laws of classical physics, Bohr 
formulated three postulates which describe the deviations from classical behavior for 
the electrons in an atom. These postulates proved to be an extremely important step 
towards quantum mechanics. They are: 
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- The classical equations of motion are valid for electrons in atoms. However, only 
certain discrete orbits with the energies are allowed. These are the energy levels 
of the atom. 

- The motion of the electrons in these quantised orbits is radiationless. An electron 
can be transferred from an orbit with lower (negative) binding energy (i.e. larger 
r) to an orbit with higher (negative) binding energy (smaller r), emitting radia- 
tion in the process. The frequency of the emitted radiation is given by 



En-En' = hv. ( 8 . 8 ) 

Light absorption is the reverse process. 

By comparing (8.8) and (8.3), Bohr identified the energy terms E^' and E^ as 



E„= - 



Rhc 



En'- 



Rhc 



(8.9) 



where the minus sign again implies that we are dealing with binding energies. 

- Finally, for the calculation of the Rydberg constant R in (8.9) from atomic quan- 
tities, Bohr used the comparison of the orbital frequencies of the electrons with the 
frequency of the emitted or absorbed radiation. In classical physics, these fre- 
quencies would be equal, as mentioned above. However, using (8.4), one can easily 
calculate that this is not at all the case in the hydrogen atom for small orbital radii r. 
Bohr*s decisive idea was then to postulate that with increasing orbital radius r, the 
laws of quantum atomic physics become identical with those of classical physics. The 
application of this “Correspondence Principle” to the hydrogen atom allows the deter- 
mination of the discrete stable orbits. 

We consider the emission of light according to the first two postulates for a transi- 
tion between neighboring orbits, i.e. for («-«') = 1, and for large n. From (8.3) we 
have for the frequency v, with n-n' = r 



V = Rc 



— ]=Rc 
n 



{n-r) 



( 8 . 10 ) 




1 

(1 - r/nf 




or, with T = 1 , 



IRc 



( 8 . 11 ) 



This frequency is now set equal to the classical orbital frequency a>/2n in (8.7), setting 
(8.7) equal to (8.9) and inserting in (8.11); this yields an equation from which R can 
be calculated: 



Rhc 1 



e^^rrio 



InlRc 



2n 1/3 



m^e^ 

Selh^c 



( 8 . 12 ) 
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From (8.12), we find for the Rydberg constant R (which we denote by R^o for reasons 
which will become apparent below) the numerical value 

^oo = (109737.318±0.012)cm"’ . (8.13) 



This may be compared with the empirical value in (8.2). In Bohr’s model, R is just the 
ionisation energy of the ground state of the atom, « = 1 . 

From (8.12), with (8.7) and (8.9), we find the radius r„ of the nth orbital to be 



n^h^Aneo 

e^rriQ 



(8.14) 



The quantum number n which occurs in these expressions is called the principal quan- 
tum number. 

In addition, we may calculate the orbital angular momentum l = r xp of an 
electron having velocity v„ and orbital frequency a»„ in the orbit with radius r„ and find, 
using (8.11) and (8.14), the quantisation rule 

\l\= mov„r„ = morla>„ = nh with n = l,2, 3,.... (8.15) 

This quantisation rule is often (but incorrectly) taken to be one of Bohr’s postulates. 

The essential common feature of the Bohr postulates is that they make no state- 
ments about processes, but only about states. The classical orbital concept is aban- 
doned. The electron’s behaviour as a function of time is not investigated, but only its 
stationary initial and final states. Figure 8.5 illustrates the model. 

Whether spectral lines are observable, either in emission or in absorption, depends 
on the occupation of the energy terms (also referred to as energy states). Absorption 
from a state presupposes that this state is occupied by an electron. In emission transi- 
tions, an electron falls from a higher state into an unoccupied lower one; the electron 
must be previously raised to the higher state by an excitation process, i.e. by an input of 
energy. At normal temperatures only the Lyman series in hydrogen is observable in 
absorption, since then only the lowest energy term (« = 1 in Fig. 8.4) is occupied. When 
the Balmer lines are observed in the spectra of stars as Fraunhofer lines (that is, these 
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lines are missing in the continuous spectrum because of absorption of light on the way 
through the stellar atmosphere), then it can be concluded that the temperature of the 
atmosphere is so high, that the first excited state of the H atom {n = 2) is also occupied. 
This is the basis of spectroscopic temperature determination utilising the Boltzmann 
distribution (2.8). For example, in the sun, with a surface temperature of 6000 K, only 
10“^ of the hydrogen atoms in the solar atmosphere are in the n = l state. 



8.4 Some Quantitative Conclusions 



We will now treat the Bohr model with arbitrary nuclear charge for hydrogen-like 
systems such as He"^, Li^'^, etc. quantitatively. The nucleus with charge Z is orbited by 
an electron in a circular orbit n at a distance and with the velocity There is an 
equilibrium between the Coulomb force and the centrifugal force: 



Ze^ 



r„ 



= mor„o}l. 



(8.16) 



where = v„/r„ is the circular frequency of the electron in its orbit n and is its 
mass. For the possible orbital radii follows, see (8.14) 



n^h^4neo 

Ze^mo 



(8.17) 



With Z = 1, n = i we find for the smallest orbital radius in the hydrogen atom 
ri(H) = 0.529 A, the right order of magnitude for the spatial extension of the neutral 
hydrogen atom. a*i(H) is referred to as the Bohr radius of the hydrogen atom in the 
ground state, abbreviated Qq. 

For the possible circular frequencies of the electronic motion we obtain 



1 Z^e^rriQ 



(8.18) 



For Z = 1 , « = 1 , the largest possible circular frequency is seen to be 
coi(H) = 10^®Hz; 



a>n would be the “classical” frequency of the emitted light if the electron behaved like a 
classical dipole in the atom. This is, however, not the case, see Sect. 8.3. The emitted 
frequency corresponds to the difference of the energy states of two orbits n and n' 
according to (8.9). The total energy is according to (8.6) 

= (8.19) 

47T£o'*n 

Substituting for r„ from (8.17) and v^, which can be obtained from (8.15), yields the 
possible energy states: 
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_ Z^e'^mo 1 

" nnhlh^ ■ ■ 

For Z = 1 , n = 1 we find the lowest energy state of the hydrogen atom: 



( 8 . 20 ) 



£'i(H)= - 13.59 eV. 

This is the ionisation energy of the H atom. 

For arbitrary Z, n = 1, one obtains 

£'i(Z)= - Z^- 13.59 eV. 

For the wavenumbers of the spectral lines we find, according to (8.3) and (8.9) 



V = — (£„-£„.) 

he 



e^rtioZ^ / 1 
647T^£o^^C 



1 



( 8 . 21 ) 



Comparison of this result with the empirically found Balmer formula (see Sect. 8.2) 
shows complete agreement with respect to the quantum numbers n and The 
quantum number n which was introduced by Bohr is thus identical with the index n of 
the Balmer formula. 



8.5 Motion of the Nucleus 

The spectroscopically measured quantity (Sect. 8.2) does not agree exactly with the 
theoretical quantity i?oo (8.13). The difference is about 60 cm“*. The reason for this is 
the motion of the nucleus during the revolution of the electron, which was neglected in 
the above model calculation. This calculation was made on the basis of an infinitely 
massive nucleus; we must now take the finite mass of the nucleus into account. 

In mechanics it can be shown that the motion of two particles, of masses mi and m 2 
and at distance r from one another, takes place around the common centre of gravity. 
If the centre of gravity is at rest, the total energy of both particles is that of a fictitious 
particle which orbits about the centre of gravity at a distance r and has the mass 



mi m2 
mi + mi 



( 8 . 22 ) 



referred to as the reduced mass. In all calculations of Sect. 8.4 we must therefore 
replace the mass of the orbiting electron, mo, by n and obtain, in agreement with 
experiment, 

R = R^ ^ . (8.23) 

1 + m(j/M 

Here mo = mi, the mass of the orbiting electron, and M=m 2 , the mass of the 
nucleus. The energy corrections due to motion of the nucleus decrease rapidly with in- 
creasing nuclear mass (Table 8.4). 
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Table 8.4. Energy correction for motion of the nucleus for the Rydberg numbers of several one-electron 
atoms 



Atom 


H('H) 


D(^H) 


T(^H) 


He+ 


Li2 + 


A 


1 


2 


3 


4 


7 


E 


5.45 


2.75 


1.82 


1.36 


0.78 


% 

E 


0.0545 


0.0275 


0.0182 


0.0136 


0.0078 



This observation makes possible a spectroscopic determination of the mass ratio 
M/mo, e.g. 

^^proton /^electron ~ 1836.15 . 

Due to the motion of the nucleus, different isotopes of the same element have 
slightly different spectral lines. This so-called isotope displacement led to the discovery 
of heavy hydrogen with the mass number A = 2 (deuterium). It was found that each 
line in the spectrum of hydrogen was actually double. The intensity of the second line 
of each pair was proportional to the content of deuterium. Figure 8.6 shows the H^line 
with the accompanying D^at a distance of about 1 A in a 1 : 1 mixture of the two gases. 
The nucleus of deuterium contains a neutron in addition to the proton. There are easily 
measurable differences in the corresponding lines of the H and D Lyman series, namely 



R^ = R^ 109677.584 cm'^ (8.24) 

1 + mo/MH 



Rj) = R^ 109707.419 cm"^ (8.25) 

1 + mo/Mj) 

The difference in wavelengths AX for corresponding lines in the spectra of light and 
heavy hydrogen is: 



zl A — Ah ~ Ad — Ah 




(8.26) 






Fig. 8.6. P lines of the Balmer series in a mixture of equal parts hydrogen (^H) and 
deuterium (^H). One sees the isotope effect, which is explained by motion of the nucleus. 
The lines are about 1 A apart and have the same intensity here, because the two isotopes 
are present in equal amounts [from K. H. Hellwege: EinfUhrung in die Physik derAtome, 
Heidelberger Taschenbucher, Vol. 2, 4th ed. (Springer, Berlin, Heidelberg, New York 
1974) Fig. 40a] 
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Table 8.5 gives the measured values. The agreement between the calculated and 
measured values is excellent. 

Historical remark: a difference of about 0.02% had been found between the values 
of the molecular weight of hydrogen determined chemically and by mass spectroscopy, 
because D is present in the natural isotopic mixture of hydrogen. Its mass was included 
in the results obtained by chemical means, but not by mass spectroscopy. In 1931, 
however, Urey discovered spectral lines which, according to their Rydberg number, 
belonged to D by observing a gas discharge through the vapour of 3 litres of liquid 
hydrogen evaporated into a 1 cm^ volume (Fig. 8.6). 



Table 8.5. Comparison of the wavelengths of corresponding spectral lines in hydrogen and deuterium. The 
lines belong to the Lyman series 



Aj)/ A 


Ar/ ^ 


1215.31 


1215.66 


1025.42 


1025.72 


972.25 


972.53 



8.6 Spectra of Hydrogen-like Atoms 

According to Bohr, the spectra of all atoms or ions with only one electron (one-electron 
systems) should be the same except for the factor and the Rydberg number. The 
spectrum of hydrogen should thus explain those of the ions He^, Li^"^, Be^"^ or any 
other ions which have only one electron. This has been completely verified experi- 
mentally (see Table 8.6 and the energy diagram in Fig. 8.7). 

For He'*', astronomers found the Fowler series 

= (8.27) 



Table 8.6. Wavelengths Aj 2 of the first Lyman lines, i.e. the spectral lines with n' = n = 2, of hydrogen and 
hydrogen-like atomic ions. The mass correction {first column) is used to calculate the Rydberg number 
{second column) and thus Aj 2 {third column). The calculated values are in good agreement with the measured 
values {fourth column) 





1+ 

"^nucl 


D 

^nucl 

[cm“‘] 


hi 

(calc) 

[A] 


hi 

(meas) 

[A] 


'H 


1.00054447 


109677.6 


1215.66 


1215.66 




1.00027148 


109707.4 


1215.33 


1215.33 


“He-^ 


1.00013704 


109722.3 


303.8 


303.6 




1.00007817 


109728.7 


135.0 


135.0 




1.00006086 


109730.6 


75.9 


75.9 


10g4 + 


1.00005477 


109731.3 


48.6 


48.6 


11b4 + 


1 .00004982 


109731.8 


12c5 + 


1.00004571 


109732.3 


^ 33.7 


^ 33.7 
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Z=1 Z=2 Z=3 

H He^ Li 




Table 8.7. Comparison of the spec- 
tral lines of the Balmer series in hy- 
drogen and the Pickering series in the 
helium ion, in A 



He+ 


H 


6560.1 

5411.6 


6562.8 (H„) 


4859.3 

4561.6 


4861.3 (H^) 


4338.7 

4199.9 


4340.5 (Hy) 


4100.0 


4101.7 (H^) 



Fig. 8.7. Some energy levels of the atoms H, He^ 
and Li^^ 



and the Pickering series 



Vp = 4J?He 



1 

7 




(8.28) 



which can also be represented as 



Vp - /?He 



1 

2^ 



1 

{n/2f 



n = 5,6 



(8.29) 



Every other line of the Pickering series thus almost corresponds to one of the Balmer 
lines of H. This is shown in Table 8.7. 

Later other He"^ series were found, such as the 



1st Lyman series 



Vli - 4/? He . 2 

iz 




(8.30) 



2nd Lyman series 



Vl 2 - 4i?He ( ^ 



(8.31) 



For Li^*^, Be^"^ and still heavier highly ionised atoms, spectral lines have been 
observed which can be calculated by multiplying the frequencies of the lines of the 
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H atom by and insertion of the corresponding Rydberg constant. With increasing 
nuclear charge Z, we quickly reach the region of x-ray wavelengths. 

In 1916, the collected spectroscopic experience concerning the hydrogen-similarity 
of these spectra was generalised in the displacement theorem of Sommerfeld and 
Kossel, which states: 

The spectrum of any atom is very similar to the spectrum of the singly charged 
positive ion which follows it in the periodic table. 

Hydrogen-like heavy atoms, i.e. heavy atoms from which all the electrons except 
one have been removed, can be prepared by accelerating the singly-ionised atoms to 
high energies and passing them through a thin foil; their electrons are “stripped off” 
on passing through the foil. For example, in order to strip all the electrons from a 
uranium atom and produce ions, they must be accelerated to energies greater 
than lOGeV. By permitting the ions to recapture one electron each, one can 
then obtain the hydrogen-like ion . The corresponding spectral lines are emitted 
as the captured electron makes transitions from orbits of high n to lower orbits. For 
the Lyman series has been observed in the spectral region around 100 keV and 
the Balmer series is in the region between 15 and 35 keV (Th. Stohlker, Phys. Bl. 52, 
42 (1996)). 



8.7 Muonic Atoms 

With the simple Bohr model, the muonic atoms, first observed in 1952, can be 
described. They contain, instead of an electron, the 207-times heavier ju meson or muon 
and are, in contrast to the Rydberg atoms, extremely small, in extreme cases hardly 
larger than the typical diameter of an atomic nucleus. 

To produce them, matter is bombarded with energetic protons (about 440 MeV), 
giving rise to other elementary particles, the pions, according to the following reaction 
schemes: 

p-\-n^n-^n-\-7i^ or p + n-^p+p+n~ . 

Here p denotes the proton, n the neutron, and n the pion. 

Pions have a charge or -e and mass m^=21?> m^. They decay into other 
particles, the muons, according to the reactions 

Here, the symbols v or v mean a neutrino or an antineutrino, the index p means muon 
neutrino (neutretto), and electron neutrinos carry the index e to distinguish them. The 
neutrinos are only shown for completeness. 

The half-life for this decay is 7]/2 = 2.5 • 10"^s. Muons may be characterised as 
heavy electrons; they have a charge e, a mass equal to 206.8 mo, and a half-life 
ri/2 = 2.2* 10"S. 

Muons decay into electrons (e“) or into positrons (e^) according to the reactions 
^e^ p~ ^e~ + Ve+Vu- 

Before they decay, they can be captured into outer atomic orbits by atomic nuclei and 
can occupy these orbits in the place of electrons. In making transitions from the outer 
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to inner orbits, the muons radiate light of the corresponding atomic transition fre- 
quency; this is light in the x-ray region of the spectrum. Since muons behave like heavy 
electrons, we can simply apply the results of the Bohr model. For the orbital radii we 
have, see (8.17) 



r 



n ~ 



Aneoh^ 2 
2 " 



(8.32) 



/■„ is thus smaller than the radius of the corresponding orbit which is occupied by an 
electron by the ratio of the electron to the muon mass. 

A numerical example; for the magnesium atom *^Mg we find 



Electron: /•,(e ) =-^^A = 4.5 • 10 '^m, 

12 

Muon: n(pi~) = ^ = 2.2 • 10~*'*m . 

207 

The muon is thus much closer to the nucleus than the electron. For the radiation 
from a transition between the levels with principal quantum numbers 1 and 2 the 
following expression holds: 



/ 1 1 



(8.33) 



that is, the quantum energy is larger by the ratio of the masses than the energy of the 
corresponding transition in an electronic atom. Finally, the muon decays as described 
above, or else it is captured by the nucleus, which then may itself decay. 

Muonic atoms are observed for the most part by means of the x-radiation which 
they emit; this radiation decays in intensity with the half-life characteristic of muons. 
Muonic atoms are interesting objects of nuclear physics research. Since the muons 
approach the nucleus very closely, much more so than the electrons in an electronic 
atom, they can be used to study details of the nuclear charge density distribution, the 




Fig. 8.8. The probability W of finding a 
muon at a distance r from the center of the 
nucleus of a muonic atom, in arbitrary units, 
for various shells; also showing the nuclear 
charge distribution {shaded area). The plot 
is for the lead nucleus, with Z = 82 and 
nuclear radius R = 6.687* 10" m (fm). The 
symbols used to indicate the shells will be ex- 
plained later. (After E. Boric and G.A. 
Rinker: Rev. Mod. Phys. 54, 68 (1982)) 
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Fig. 8.9. Muonic terms for an atom with Z = 60. The fully drawn levels correspond to the assumption of a point nucleus; the dashed levels take 
account of the finite nuclear size. The notation used for the transitions corresponds to that used for x-ray lines (Chap. 18). Note the energy 
scale 



distribution of the nuclear magnetic moment within the nuclear volume and of nuclear 
quadrupole deformation. 

Figure 8.8 shows the spatial distribution of a muon in several orbits of a lead atom. 
It can be seen that the muons in these orbits spend a considerable amount of time in the 
nucleus or in its immediate neighbourhood. Since the muons approach the nuclear 
charge Ze very closely, the binding and excitation energies become extremely large. 

Figure 8.9 shows a term diagram of the muonic-atom levels for a nuclear charge 
number Z = 60. The analogy with the hydrogen atom is evident; however, the transi- 
tions here are in the energy region of MeV, i.e. in the region of hard x-rays and of 
gamma rays. For the investigation of such muonic atoms, one therefore requires the 
tools of nuclear physics. Detection of the radiation is carried out with scintillator or 
solid-state detectors. 

Finally, Fig. 8.10 shows an example of the measurement of radiation from a 
muonic atom, the Lyman series in the muonic spectrum of titanium. The notations 
s,p,d, etc. in Figs. 8.8-10 refer to the orbital angular momentum of the electrons 
(muons). They will be further described in Sect. 8.9. 



8.8 Excitation of Quantum Jumps by Collisions 

Lenard investigated the ionisation of atoms as early as 1902 using electron collisions. 
For his measurements, he used an arrangement following the principle of the experi- 
mental scheme shown in Fig. 8.11. The free electrons produced by thermionic emission 
are accelerated by the positive grid voltage Vq and pass through the open-meshed grid 
into the experimental region. Between the grid and the plate A at the right of the 
drawing, which serves as the third electrode, a plate voltage is applied. The plate is 
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Gas at 




arrangement 




Fig. 8.11. Experimental arrangement for detecting ionisa- 
tion processes in gases. Only positive ions, which are form- 
ed by collisions with electrons, can reach the plate A. In the 
lower part of the figure, the plate current is plotted as a 
function of the grid voltage Vq. is the voltage with which 
the electrons must be accelerated in order to be able to 
ionise the atoms 



Gas at 




Fig. 8.12. Experimental arrangement of Franck and Hertz for investigating 
inelastic collisions between electrons and atoms. Electrons on the way from 
the grid to the anode can transfer their kinetic energies partially (particle 1) 
or completely (particle 2) to the gas atoms. The anode current as a func- 
tion of the grid voltage is plotted in the lower part of the figure. At high 
grid voltages, several energy-transfer processes can occur one after the 
other 



negatively charged relative to the grid. The voltages are chosen so that the electrons 
cannot reach the plate; they pass through the grid and are repelled back to it. When an 
electron has ionised an atom of the gas in the experimental region, however, the ion is 
accelerated towards the plate A. Ionisation events are thus detected as a current to the 
plate. 

The current is plotted as a function of the grid voltage Vq in the lower part of 
Fig. 8.11. Only when the electrons have a certain minimum energy e does the current 
appear. The corresponding accelerating potential k- is the ionisation potential of the 
atoms. 

Franck and Hertz showed for the first time in 1913 that the existence of discrete 
energy levels in atoms can be demonstrated with the help of electron collision processes 
independently of optical-spectroscopic results. Inelastic collisions of electrons with 
atoms can result in the transfer of amounts of energy to the atoms which are smaller 
than the ionisation energy and serve to excite the atoms without ionising them. 

The experimental setup is shown schematically in Fig. 8.12. Electrons from a heated 
cathode are accelerated by a variable voltage Vq applied to a grid. They pass through 
the grid and are carried by their momenta across a space filled with Hg vapour to an 
anode A. Between the anode and the grid is a braking voltage of about 0.5 V. Electrons 
which have lost most of their kinetic energy in inelastic collisions in the gas-filled space 
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can no longer move against this braking potential and fall back to the grid. The anode 
current is then measured as a function of the grid voltage Vq at a constant braking po- 
tential Kb- 

The result is shown in the lower part of Fig. 8.12. As soon as Vq is greater than V^, 
the current increases with increasing voltage (space-charge conduction law). At a value 
of Kg = 5 V (in mercury vapour) the current I is strongly reduced; it then increases 
again up to Vq = 10 V, where the oscillation is repeated. The explanation of these 
results is found by making the following assumptions: when the electrons have 





Typical measurennent 



Fig. 8.13. Improved experimental setup for determining 
atomic excitation energies by electron collisions. The colli- 
sions take place in the field-free space between the two grids 
G. In this way, a high resolution is reached. In the lower 
part of the figure, an experimental result obtained with Hg 
vapour is shown in part 



reached an energy of about 5 eV, they can give up their energy to a discrete level of the 
mercury atoms. They have then lost their energy and can no longer move against the 
braking potential. If their energy is 10 eV, this energy transfer can occur twice, etc. 
Indeed, one finds an intense line in emission and absorption dit E = 4.85 eV in the 
optical spectrum of atomic mercury, corresponding to a wavelength of 2537 A. This 
line was also observed by Franck and Hertz in the optical emission spectrum of Hg 
vapour after excitation by electron collisions. The excitation or resonance voltages are 
denoted in Figs. 8.12, 13 as 

The resolving power for the energy loss of the electrons may be improved by using 
an indirectly heated cathode and a field-free collision region. In this way, one obtains a 
better uniformity of the energies of the electrons. With an improved experimental 
arrangement (Fig. 8.13), a number of structures can be seen in the current-voltage 
curve; these correspond to further excitations of the atoms. The step at 6.73 eV, for 
example, corresponds to a further intense line in the Hg spectrum; 6.73 eV ^ 1850 A. 

Not all the maxima in the current-voltage curve can be correlated with observed 
spectral lines. To explain this fact, we have to assume that optically “forbidden” transi- 
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0.33 citT’—^ 



Fig. 8.14. line of the 
Balmer series at a high spec- 
tral resolution. Three com- 
ponents can be distinguish- 
ed. A still higher resolution 
is shown in Fig. 12.24. The 
resolution reached in this 
spectrum is limited by Dop- 
pler broadening 



tions can, in some cases, be excited by collisions. We shall see later that there are selec- 
tion rules for optical transitions between energy terms of atoms, according to which not 
all combinations of terms are possible - one says “allowed”. The selection rules for 
collision excitation of atoms are clearly not identical with those for optical excitation 
(or de-excitation). 

In this connection, the following experiment is interesting: Na vapour at low pres- 
sure can be excited to fluorescence by illumination with the yellow Na line (quantum 
energy 2.11 eV). The excitation occurs only when the light used for illumination has 
exactly the quantum energy 2.11 eV. Both smaller and larger quantum energies are in- 
effective in producing an excitation. 

Excitation by means of collisions with electrons are in this respect quite different: in 
this type of excitation, the yellow line is emitted whenever the energy of the electrons is 
equal to or larger than 2.11 eV. This can be explained as follows: the kinetic energy of 
free electrons is not quantised. After excitation of a discrete atomic energy level by 
electron collision, the exciting electron can retain an arbitrary amount of energy, 
depending on its initial value. This remaining energy can, if it is sufficiently large, serve 
to excite still other atoms in the gas volume. 

All in all, these electron collision experiments prove the existence of discrete excita- 
tion states in atoms and thus offer an excellent confirmation of the basic assumptions 
of the Bohr theory. In modern atomic and solid state physics, energy-loss spectra of 
electrons represent an important aid to the investigation of possible excitation stages of 
atoms and of the structure of the surfaces of solids. 

8.9 Sommerfeld’s Extension of the Bohr Model 

and the Experimental Justification of a Second Quantum Number 

The finished picture of the Bohr model still contained some fuzzy details: exact spectral 
measurements at high resolution showed that the lines of the Balmer series in hydrogen 
are, in fact, not single lines. Each of them consists rather of several components; how 
many one can distinguish depends on the resolution of the spectrometer employed. 

The line of hydrogen with v = 15233 cm“^ consists, for example of a multiplet 
with a wavenumber splitting of zlv = 0.33cm“^ between the strongest components 
(Fig. 8.14). In order to observe this structure, a spectral resolution of nearly 
v/Av= 100000 is needed. In the spectrum of the one-electron ion He'^, these multiplet 
lines are more strongly separated, and the splitting is therefore easier to observe. 
We shall see in Chap. 12 that the splitting increases as the 4th power of the nuclear 
charge number Z. 

From observations of this type, Sommerfeld derived an extension of the Bohr 
model. It is well known from classical mechanics that, according to Kepler’s Laws, not 
only circular orbits, but also elliptical orbits are possible, having the same energies. 

From this, Sommerfeld drew the conclusion that the same is true in atoms also. In 
order to distinguish the elliptical orbits from the circular ones, a new, second quantum 
number is required. Since Sommerfeld"^ chain of reasoning was on the one hand of 
great historical importance in introducing a second quantum number, but has, on the 
other hand, been made obsolete by the later quantum mechanical treatment, we will 
only give a brief summary here. 

The principal quantum number n remains valid; it continues to determine the total 
energy of a term according to (8.20), i.e. 
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E„= - 



RhcZ^ 



(8.34) 



According to Sommerfeld, n also determines the major semiaxis of the ellipse. The 
minor semiaxis is determined by the second quantum number in such a fashion that the 
absolute value of the angular momentum remains a whole multiple k of h, with k ^n. 
The length of the minor semiaxis, that is the eccentricity of the ellipse, has in this model 
no influence on the total energy. Each principal quantum number n corresponds to one 
major semiaxis but to various orbital shapes, characterised by the minor semiaxis 
bn,k- We say that the energy term is Az-fold degenerate, by which is meant that different 
orbits with two quantum numbers n and k belong to one and the same energy value. 

We should mention at this point that in quantum theory, the Sommerfeld second 
quantum number k became the orbital angular momentum quantum number /(/ = 
A: - 1). The orbital angular momentum of the electron is given by (as we shall show in 
Chap. 10) 

\l\ = \/l(,l+\)h with / = 0 , 1 , 2,...«-1 . (8.35) 

In order to distinguish the orbital angular momentum itself, I, from its quantum 
number /, we shall henceforth use the symbol |/ 1 for the absolute value of the angular 
momentum vector 1. 

For the various numerical values of the angular momentum quantum number, letter 
symbols 5, p, d, /, g, h, etc. have become firmly established; these are listed in the 
following table: 



Quantum number 


1 = 0 


1 


2 


3 


4 


5 


Angular momentum 


\l\ = 0 


V^h 




V^h 


]/2!0h 


ol 


Name (Symbol) 


s 


P 


d 


f 


g 


/^-electron or state. 


What this means in 


terms of the 


spatial 


form of the electron orbitals will be 



explained later, together with the solution of the Schrodinger equation (Chap. 10). 



8.10 Lifting of Orbital Degeneracy by the Relativistic Mass Change 

We still have no explanation for the doublet or multiplet structure of the spectral lines 
mentioned at the beginning of the last section. However, we now know that each level 
is «-fold degenerate; by this we mean the fact that each energy level has various pos- 
sibilities for the spatial distribution of the electrons occupying it. The number of levels 
with differing energies, and thus the number of observable spectral lines, however still 
remains the same. 

The lifting of this degeneracy occurs, according to Sommerfeld (1916), through the 
effect of the relativistic mass change, m = m(v), which we have neglected up to now. 
We can understand this qualitatively as follows: exactly as in planetary motion accord- 
ing to Kepler’s Laws, the electrons are accelerated when they come near to the nucleus. 
This is a result of Kepler’s Law of Areas, which requires that the moving electron sweep 
out equal areas between its orbit and the nucleus in equal times. In the neighbourhood 
of the nucleus, the electrons are thus faster and, from special relativity, more massive. 
This leads, in turn, to a decrease in energy: increased mass means, according to Bohr, a 
smaller radius, and this leads to a larger (negative) binding energy, i.e. to a decrease in 
total energy. The smaller the minor semiaxis of an ellipse, the more significant these 
relativistic corrections must become. 
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Fig. 8.15. Rotation of the perihelion point in the motion of an 
electron around the nucleus in a many-electron atom according 
to the Sommerfeld theory. The shaded region is the electronic 
shell of the atom. The outer electron follows a so-called “diving 
orbit” in its motion, i.e., it dives into the atomic shell. This 
model provides an intuitive explanation of the lifting of orbital 
degeneracy due to the relativistic mass effect (Sect. 8.10) and to 
the shielding of the nuclear charge (see Sect. 11.2) 



We will not repeat Sommerfeld'^ calculation here; we just give the result. The rela- 
tivistic mass change leads to a rotation of the perihelion point of the orbits; in an intui- 
tive picture, the electron then has a “rosette motion” about the nucleus (Fig. 8.15). 

In Sommerfeld'^ calculation, the “fine structure constant” plays a role: 

Velocity of the electron in the 1st Bohr orbit 

a = 

Velocity of light 

= — = (dimensionless) . 

Isohc 137 



For an electron orbit with the quantum numbers n and k, the result of 
Sommerfeld's calculation of the relativistic mass effect is 



£■„,*= -Rhc- 



1 + 






r n 3 \ j 

— - — + higher-order corrections 
Kk 4 



(8.36) 



The relativistic energy change is thus of the order of = 10”^, i.e. small, but observ- 
able (see Fig. 8.14). Applying the models developed by Sommerfeld, the structures of 
the hydrogen atom mentioned thus far can be described both qualitatively and quan- 
titatively. However, further experiments, which we shall describe in Chap. 12, pointed 
out the limits of these models. An adequate description of the relativistic motion of 
the electron is provided by the Dirac equation (cf. Sect. 14.6). 



8.11 Limits of the Bohr-Sommerfeld Theory. 

The Correspondence Principle 

The Bohr-Sommerfeld model is theoretically unsatisfying: on the one hand, classical 
mechanics is set aside, and only certain particular orbits are allowed; on the other 
hand, classical physics is used to calculate the orbits, see Sect. 8.3. It is as though, “On 
Mondays, Wednesdays and Fridays one uses the classical laws, on Tuesdays, Thurs- 
days, and Saturdays the laws of quantum physics” (Bragg). Furthermore, the model 
predicts only the frequencies but not the intensities or the time dependence of emitted 
or absorbed light. 

The gap which had opened between classical physics and the (early) quantum theory 
was bridged by Bohr with his Correspondence Principle. 

According to this principle, for large quantum numbers, the classical and quantum 
theories approach one another; or, the behaviour of an atom approaches that expected 
from classical, macroscopic physics, the larger its energy relative to the energy change 
which occurs in the process considered, i.e. all the more, the higher the level and the 
smaller the level difference. 
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Starting from considerations such as the above, one arrives at the following general 
formulation of the Correspondence Principle: 

Every non-classical theory must, in the limit of high energies and small energy 

changes, yield the results of classical theory. 

The intensities, polarisations, and selection rules for spectral lines may be 
calculated from the laws of classical physics. The Correspondence Principle allows us, 
within limits, to translate these results, by using a prescription for quantisation, into 
the quantum theory. 

In spite of a series of successes, the application of the Bohr-Sommerfeld theory led 
to fundamental difficulties. The results were wrong even for atoms with two electrons. 
The magnetic properties of atoms were not correctly described. The removal of these 
difficulties was accomplished with the development of modern quantum mechanics. In 
Chap. 10, we will treat the hydrogen atom problem exactly with the help of quantum 
theory; we shall find there that some of the results of the Bohr-Sommerfeld theory 
remain valid, while others must be modified. 



8.12 Rydberg Atoms 



Atoms in which an electron has been excited to an unusually high energy level illustrate 
well the logical continuity between the world of classical physics and quantum 
mechanics. 

Such atoms, called Rydberg atoms, have extraordinary properties. They are 
gigantic: Rydberg atoms are known with diameters reaching 10“^ mm, corresponding 
to a 100000-fold increase over the diameters of atoms in the ground state. Further- 
more, these excited states have extremely long lifetimes. While typical lifetimes of 
lower excited states of atoms are about 10 “®s, there are Rydberg atoms which have 
lifetimes of 1 s. The difference in energy between two neighboring states n and n' 
becomes very small when n is large. The long lifetimes of such states are in part a result 
of the fact that the probability of a spontaneous transition between two states n and n' 
is, aceording to Einstein (Sect. 5.2.3), proportional to v^. In addition, Rydberg atoms 
may be strongly polarised hy relatively weak electric fields, or even completely ionised. 

When the outer electron of an qtom is excited into a very high energy level, it enters 
a spatially extended orbit - an orbital - which is far outside the orbitals of all the 
other electrons. The excited electron then “sees” an atomic core, consisting of the 
nucleus and all the inner electrons, which has a charge -t- e, just the same as the charge 
of the hydrogen nucleus. As long as the excited electron does not approach the core too 
closely, it behaves as though it belonged to a hydrogen atom. Rydberg atoms behave 
therefore in many respects like highly excited hydrogen atoms. 

In interstellar space, there are atoms whose outer electrons are in states with principal 
quantum numbers n up to 350; this has been observed by radio astronomical methods. 
In the laboratory, Rydberg atoms with principal quantum numbers between 10 and 290 
have been studied. A recent example of still larger values of n is shown in Fig. 8.18. 

The orbital radius of an electron in an atom is proportional to n^ (8.17). The 
spacing between neighbouring energy levels decreases as n~^. It is because these higher 
powers of n have especially large effects for large n-values that Rydberg atoms have 
their unusual properties. 
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Fig. 8.16. Apparatus for the detection of Rydberg 
atoms. An atomic beam is crossed by several (here 2) 
laser beams. They cause the excitation of the atoms 
into Rydberg states when the sum of the quantum 
energies of the laser beams corresponds to the excita- 
tion energy of a Rydberg state. The Rydberg atoms 
are ionised in the electric field of a condenser, and the 
ions are then detected 



Fig. 8.17. An example of the detection of Rydberg 
states of the lithium atom with az = 28 to 39, measured 
with an apparatus like that shown in Fig. 8.16. The 
distance (in wave number units) to the series limit is 
plotted as the ordinate 




Rydberg atoms are produced by exciting an atomic beam with laser light. To detect 
the highly excited atoms, an electric field is applied between the plates of a condenser 
through which the atomic beam passes. Through field ionisation, the atoms can be con- 
verted to ions with the aid of small electric fields of the order of a few hundred V cm “ ^ 
The ions can be detected by means of their charge, for example with the aid of an 
electron multiplier or channeltron. An example of an experimental setup is shown in 
Fig. 8.16; Fig. 8.17 shows some experimental results. In Fig. 8.17, the result of exciting 
a beam of lithium atoms with three laser beams is shown. The first two excite the atoms 
into intermediate excited states (e.g. here n = 3, 1=0), while the third is continuously 
variable within a small energy range and adds the last necessary energy contribution to 
put the atoms into a Rydberg state. By continuously changing the frequency of this last 
laser, the experimenter can excite a series of Rydberg states of the atoms one after 
another - in the figure, the states with /? = 28 to 39. Thus, a particular Rydberg state 
can be chosen and selectively excited in order to investigate its physical properties. 

When a Rydberg atom reduces its principal quantum number by 1 in emitting a light 
quantum, the light is in the far infrared or microwave region of the electromagnetic 
spectrum. With this radiation, isolated Rydberg atoms were first discovered in 1965 in 
interstellar space. The density of atoms is so low there that collisions are extremely 
rare. 

It has been possible to investigate Rydberg atoms in the laboratory since narrow- 
band, tunable lasers have been available (especially dye lasers, see Chap. 21). Since 
then, the energy levels, lifetimes, spatial extension of the wavefunctions, and the in- 
fluence of electric and magnetic fields have been studied for quantum numbers which 
were previously only theoretical. The predictions of theory have been fully confirmed. 
Table 8.8 contains an overview of the properties of Rydberg atoms. 
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ENERGY /GHz 



Fig. 8.18. Rydberg excitation states of barium atoms with the principal 
quantum number n, observed using Doppler-free spectroscopy. The abcissa 
gives the distance from the series limit in units of GHz. From J. Neukammer 
et al.: Phys. Rev. Lett. 59, 2947 (1987) 



Table 8.8. Some properties of Rydberg atoms, valid for unperturbed electronic states 



Property 


General 


Rydberg atoms, 
^ = 30 


Size 


d = a 


10-’ A 


Binding energy 


8 

II 

1 


10“^ eV 


Transition energy 
An= 1 


AE = 2R/n'^ 


10--’ eV 10 cm-' 


Lifetime 


Tocn^ 


30-10-S 



8.13 Positronium, Muonium, and Antihydrogen 

It is possible to make artificial atoms in which one or both of the atomic components 
of hydrogen, the proton and the electron, are replaced by their corresponding antipar- 
ticles. The antiparticle of the proton is the antiproton, p \ that of the electron is the 
positron, As far as is currently understood, particles are distinguished from their 
antiparticles only through the opposite sign of their electric charges and magnetic 
moments, cf. Sect. 14.6. Therefore, all the conclusions of the Bohr model concerning 
atomic radii, energy levels, and transition frequencies derived in Sects. 8.4 and 8.5 
should also apply to atoms containing antiparticles. Here, we shall treat briefly the 
“exotic” atoms positronium, muonium, and antihydrogen. 

Positronium, an “atom” consisting of an electron, e~ , and a positron, e"" , was 
discovered in 1949 by M. Deutsch. It is formed when positrons and electrons enter a 
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Table 8.9. The reduced mass in units of the electron mass, the binding energy £’g, the energy spacing be- 
tween the « = 1 and n = 2 levels, and the first Bohr radius Qq for positronium and muonium in comparison 
to the H atom 







mo 


Eb 


1 

r 




Hydrogen 


p^e 




13.6eV 


10.2 eV 


0.53 A 


Positronium 


e^e~ 


0.5 


6.8 eV 


5.1 eV 


1.06 A 


Muonium 






13.6 eV 


10.2 eV 


0.53 A 



(short-lived) bound state e~), before they annihilate each other with the emission 
of two y-quanta. If the particles have no kinetic energy before their annihilation, each 
of the y-quanta has an energy equal to = 511 keV, where is the mass of the 
electron. The lifetime of socalled parapositronium, with overall spin S = 0 (see 
Sect. 17.3), is 1.25 • 10“ ^^s. Orthopositronium, with 5 = 1, is produced with a smaller 
probability and has a longer lifetime of 1.4- 10“^ s. It decays into 3 or more y-quanta. 

Positrons can be obtained from the radioactive decay of nuclei, e.g. of ^^Na, and 
are thus relatively readily available. Positronium atoms are formed when positrons 
pass through a gas or impinge on solid surfaces, where the positron can capture an elec- 
tron. During the brief lifetime of the atoms, their binding energies and excitation 
energies can be measured and the results of the Bohr model thus confirmed. 

According to (8.20) and (8.22), the energies of the levels should be proportional to 
the reduced mass, and therefore half as large as in the hydrogen atom. The orbital radii 
and the wavelengths of the emitted radiation should be twice as large as in hydrogen. 
Both effects are observed as predicted; cf. Table 8.9. 

In condensed-matter physics and in modern medicine, positronium atoms are used 
as probes for structures and dysfunctions, because the emission of their annihilation 
radiation, and thus their lifetimes, is dependent on their material surroundings. In 
medicine, positron emission tomography is used for example to form an image of 
diseased tissue in the brain. 

Muonium, e~), is so to speak the lightest muonic atom (cf. Sect. 8.7). It is 
formed in a similar way to positronium, when positive muons, jUL ^ , enter into a bound 
state with electrons on passing through a gas or on a solid surface. Like negative 
muons, jbi "" particles are unstable (see Sect. 8.7), and the lifetime of muonium is cor- 
respondingly only 2.2* 10"^ s. According to (8.20) and (8.22), its binding energy is 
13.5 eV, only slightly different from that of hydrogen, due to the nearly equal reduced 
masses. The orbital radii are obtained from (8.17), and the lowest optical excitation 
from the state with n = \ (15 state) to the state with n = 2 (25) is found from (8.21) 
to be 10.15 eV; cf. also Table 8.7. These atoms have been studied extensively by spec- 
troscopic methods, but we will not discuss the results further here. They are particular- 
ly relevant to the refinements of the Bohr model by Dirac’s relativistic quantum 
mechanics, which we will treat in Chap. 12. 

Particularly interesting is the antihydrogen atom, (pe^), which consists of a 
positron bound to a negatively-charged antiproton. According to the postulates of 
quantum mechanics, antimatter should behave just like ordinary matter. An experi- 
mental test has yet to be performed, since antimatter was not available until very 
recently. In 1995, the successful preparation of antihydrogen was reported for the first 
time. It was carried out as follows: 
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Antiprotons can be produced in accelerators having particle beams of sufficiently 
high energy, for example at CERN in Geneva. When they pass through the Coulomb 
field of an atomic nucleus (xenon gas was used), a portion of the kinetic energy of the 
antiprotons is converted into /e~ pairs. With a small probability, the slowed anti- 
proton p can capture a positron , giving rise to an atom of antihydrogen, (pe^). 
It is electrically neutral and therefore leaves the accelerator ring on a tangential orbit. 
Thus far, these antihydrogen atoms have been detected only via their decays: the 
is stripped off the atom when it passes through a Si semiconductor particle detec- 
tor. This positron annihilates with a negative electron, and the resulting annihilation 
radiation is detected and measured by a Nal scintillation counter. The remaining p is 
analysed with respect to its mass, charge, and velocity by additional detectors. In the 
first report (by the group of W. Oelert, in Phys. Lett. B (1996)), the detection of 8 anti- 
hydrogen atoms is described; they were produced by a beam of 10^^ antiprotons 
during a beam time of 15 hours. Their lifetime was about 40 ns. 

One goal of such efforts is the spectroscopic investigation of the antihydrogen 
atoms, as a test of the symmetry of the interactions between matter and antimatter. 
For this purpose, e.g. for the observation of the Balmer series of antihydrogen, the very 
few atoms as yet produced, which in addition have high kinetic energies and short 
lifetimes, are naturally not sufficient. 

Another experiment designed to produce antihydrogen is planned to yield the atoms 
in a state of rest, without kinetic energy, by using a combination ion trap for positive 
heavy and negative light particles (cf. Sect. 2.4.6) and trapping antiprotons and 
positrons in it at the same time. The group of Th, Hdnsch has already reported the 
simultaneous trapping of positive and negative particles in such a combined trap (Phys. 
Rev. Lett. 75, 3257 (1995)). The binding of trapped particles to antihydrogen has how- 
ever not yet been observed. 



Problems 

8.1 Calculate the recoil energy and velocity of a hydrogen atom in a transition from 
the state n = A io the state « = 1, in which a photon is emitted. 

8.2 Five of the Balmer series lines of hydrogen have the wavelengths 3669.42 A, 
3770.06 A, 3835.40 A, 3970.07 A and 4340.47 A. Plot v as a function of n for the 
Balmer series. From this, determine the value of n for the upper level of each of the five 
wavelengths above. 

8.3 The absorption spectrum of hydrogen can be obtained by allowing white light to 
pass through hydrogen gas which is in the ground state and contains atomic hydrogen 
(not just H 2 ). Which photon energies are observed in the hydrogen absorption spec- 
trum? Give the wavelengths of these “Fraunhofer lines”. 

8.4 a) The emission spectrum of the hydrogen atom is taken with a diffraction grating 
(line spacing d= 2 pm). A line of the Balmer series is observed in the second order at an 
angle 0=29°5'. What is the quantum number of the excited state from which the 
transition starts? 

b) What is the minimum number of lines necessary in a diffraction grating if the 
first 30 spectral lines of the Balmer series of the hydrogen atom are to be resolved in the 
first-order diffraction spectrum? 
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8. Bohr’s Model of the Hydrogen Atom 



Hint: In this case, the number of lines corresponds to the required resolution k/Ak, 

8.5 Is it true that in a circular Bohr orbit, the potential energy is equal to the kinetic 
energy? If not, where does the energy difference go which arises if we assume that the 
electron and the nucleus are initially infinitely far apart and at rest? How large is 
compared to for the various Bohr orbits? 

8.6 The attractive force between a neutron (mass M) and an electron (mass m) is 
given by F= GMm/r^. Let us now consider the smallest orbit which the electron can 
have around the neutron, according to Bohr’s theory. 

a) Write a formula for the centrifugal force which contains m, r and v\ r is the radius 
of the Bohr orbit, and v is the velocity of the electron in this orbit. 

b) Express the kinetic energy in terms of G, M, m and r. 

c) Express the potential energy in terms of G, A/, m and r. 

d) Express the total energy in terms of G, M, m and r. 

e) Set up an equation which corresponds to the Bohr postulate for the quantisation of 
the orbits. 

f) How large is the radius r of the orbit with n = V. Express r in terms of h, G, M and 
m\ give the numerical value of r. 

8.7 For the Bohr model of the atom, calculate the electric current and the magnetic 
dipole moment of the electron in the first three orbits (« = 1, 2, 3). 

Hint: Use (12.1 -7) to calculate the magnetic dipole moment. 

8.8 “Positronium” is a bound electron-positron pair. The positron is the anti-particle 
corresponding to the electron. It has a charge + e and the same rest mass as the elec- 
tron. On the assumption that e~ and - in analogy to the H atom - circle the com- 
mon centre of gravity, calculate the rotational frequency cu, the radius r and the 
binding energy of the system in the ground state. 

8.9 A muonic atom consists of an atomic nucleus with nuclear charge Z and a 
captured muon, which is in the ground state. The muon is a particle with a mass 207 
times that of the electron; its charge is the same as that of the electron. 

a) What is the binding energy of a muon which has been captured by a proton? 

b) What is the radius of the corresponding Bohr orbit with « = 1? 

c) Give the energy of the photon which is emitted when the muon goes from the state 
« = 2 to the ground state. 

8.10 Estimate the number of revolutions A^an electron makes around the nucleus in 
an excited hydrogen atom during the average lifetime of the excited state -10“^s-if 
a) it is in the state with « = 2, and b) in the state with n=\5, before it returns to the 
« = 1 state, c) Compare these numbers with the number of revolutions the earth has 
made around the sun in the 4.5 x 10^ years of its existence. 

8.11 In addition to the isotope "^He, natural helium contains a small amount of the 
isotope ^He. Calculate the differences in the wavenumbers and energies of the first and 
third lines of the Pickering series which result from these mass differences. The relative 
isotopic masses are: 

^He: 3.01603 u and ^He: 4.00260 u . 




Problems 
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8.12 Which lines of the hydrogen spectrum lie in the visible region of the spectrum 
(between 4000 A and 7000 A)? Which helium lines fall in the same region? How could 
one tell whether a helium sample has been contaminated with hydrogen? 

8.13 Estimate the relative relativistic correction AEn^i^/E^ for the n = 2 levels in the 
hydrogen atom. 

Hint: Compare (8.29). 

8.14 To excite the hydrogen atom into its Rydberg states, one uses the additive 
absorption of the light from two lasers. Let the first of these have a fixed emission 
wavelength A, which corresponds to 11.5 eV. What wavelengths must the second laser 
have in order to pump atoms into the state with n = 20, 30, 40 or 50? How large are the 
radii and binding energies for these states? What is the maximum possible linewidth for 
both lasers if only a single n state is to be populated? 

8.15 a) Calculate the frequency of the orbital motion of an electron in a hydrogen 
atom for a level with the quantum number n. 

b) Calculate the frequency of the radiation emitted in the transition from the state n 
to the state n-i, 

c) Show that the results of a) and b) agree if n is very large. 

8.16 Estimate the magnitude of the correction terms which must be applied to the 
energies of the stationary states of the lightest atoms, i.e. ^H, ^H, ^H, He^ and Li^"^, to 
account for the motion of the nucleus. 

8.17 If one did the Franck-Hertz experiment on atomic hydrogen vapour, which lines 
in the hydrogen spectrum would one see if the maximum energy of the electrons were 
12.5 eV? 

8.18 Four lines in the Balmer series of He"^ have the wavelengths 164.05 nm, 
121.52 nm, 108.45 nm, and 102.53 nm. Plot the wavenumbers v as a function of n. Is 
there a more reasonable way to plot these data? Find the value of n in the upper level 
for each of the wavelengths given. 

Hint: What is the meaning of the Balmer series? He^ is analogous to H. 

8.19 A Wannier exciton is a bound state between an electron and a hole in a solid. 
Apply the Bohr model to such an electron-hole pair, taking the effective masses and 
the dielectric constant of the semiconductor (surrounding medium) into account. 

a) What are the energies of the excited states with 2<«<5? 

b) What does the absorption spectrum of such an exciton book like? 

Hint: A hole is a missing electron, which ideally has the same properties as an electron 
except for its positive charge. As an example, consider CU 2 O, with £r^l0, reduced 
mass fjL-0.1 m^. Literature: C. Kittel: Introduction to Solid State Physics. 




9. The Mathematical Framework of Quantum Theory 



As we saw in the previous chapter, classical physics is unable to offer a satisfactory 
explanation of the structure of even the simplest atom, that of hydrogen. This was first 
achieved by quantum theory. We shall therefore go into the theory in more depth, 
beginning where Chap. 7 left off. We shall be particularly, but not exclusively, 
concerned with bound states, of which the simplest example is 



9.1 The Particle in a Box 

In order to become more familiar with the formalism of quantum theory, which will 
then lead to quantitative predictions, we first consider the one-dimensional motion of 
an enclosed particle. “Enclosed” means that it can only move in a “box” of length a. 
The probability of finding the particle outside the box is zero (Fig. 9.1). We shall now 
attempt to construct the appropriate wavefunction. We require that 

(// = 0 for X <0 , 

(9.1) 

if/ = 0 for x>a , 

because the particle cannot be outside the box. We further postulate that the wavefunc- 
tion y/{x) inside the box is continuous with the function outside, i.e. that 



l//(0) = 0 , y/(a) = 0 . 



(9.2) 




Fig. 9.1. Potential barriers for a particle in a box 



We are seeking wavefunctions which describe a particle in this box and simultaneously 
guarantee that the particle always has a certain definite energy. We recall the de Broglie 
waves 



A exp [i {kx — cot)]. 



(9.3) 




126 



9. The Mathematical Framework of Quantum Theory 



According to the basic laws of quantum theory, the wavenumber k and the frequency 
(o are related to the particle’s energy and momentum by the relations 

E=hco (9.4) 

and 

p = hk. (9.5) 

From the experiments described above, we know that we can use the relationship 
from classical physics. 




2 mo 



If we express p in terms of k, and solve for k, we obtain two possible values for k. 



M,2 ■ 



± — \/2moE , 
h 



(9.7) 



for the given value of total energy E. 

In addition to the wavefunction (9.3), the wavefunction 

AQX^{-'\kx-\Q)t) (9.8) 

yields the same energy. This will help us out of a difficulty. As one can see by substi- 
tuting X = 0 and x = a in (9.3), the wavefunction (9.3) does not satisfy the boundary 
conditions (9.2). One way out is the following: since electron waves display diffraction 
and interference, we may infer that we can superpose waves in quantum mechanics, as 
we did in fact with wave packets in Sect. 7.1. We therefore generate a new wavefunc- 
tion by superposing (9.3) and (9.8): 

y/{x,t) = (Cie'*^-f C 2 e“‘*^)e“'“", (9.9) 

where the constants Q and C 2 are still unknown. 

To abbreviate, we write (9.9) in the form 

V/(x, t) = 0(x)e“‘"' (9.9a) 

where 

0(x) = Cie‘*''-t-C 2 e“‘^^. (9.9b) 

In order to determine the constants Q and C 2 , we substitute (9.9) in the first equation 
(9.2) and obtain 



0(0) = 0: Ci-t-C2 = 0. 



(9.10) 




9.1 The Particle in a Box 
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Thus C 2 can be expressed in terms of Cj. (9.9) then takes the form 

<p(x) = Ci(e‘*^-e-‘*^) = 2iCi sinA:x; (9.11) 

here we have made use of the definition of the sine function. To fulfil the second 
condition of (9.2), we substitute (9.11) in (9.2) and obtain: 

because 0(a) = 0 ; the condition sin ka = 0 , (9.12) 

Since the sine can only be zero if its argument is a whole multiple of tt, we can only 
satisfy (9.12) by the choice of 

k = — , n = 1,2,3,4... . (9.13) 

a 

This result means that the only waves which will fit into the box have a half-wavelength 
equal to a whole fraction of the length of the box, a (Fig. 9.2). If we substitute (9.13) in 
the expression for kinetic energy (9.6), we obtain 

= ^ (9.14) 

2wo \ a / 

for the energy of the particle, with the condition that n^ \ must be an integer. The 
parameter n cannot be equal to zero, because otherwise the wavefunction would be 
identically equal to zero. In other words, there would be no particle. 

The result (9.14) is typical for quantum theory. The energies are no longer con- 
tinuous as in classical physics, but are quantised. In order to determine Ci in (9.11), 
which is still open, we remember that the wavefunction must be normalised. We thus 
have the condition J y/^y/dx = 1 to fulfil. If we substitute (9.11) in this, we first obtain 

j|0(jc)prfx= |Ci|^J(2-e^^^-e~^^^)rfx. (9.15) 

0 0 




Fig. 9.2. Potential barriers, energies and wavefunctions of the 
particle in the box. Two different parameters are plotted in the 
same figure. 1) The energies E 2 , E^ of the first 3 states are 
plotted along the E ( = energy) axis. (There is an infinite series of 
higher energies above these.) 2) The x axis is drawn to the right of 
each of the E values, and the wavefunction appropriate to each 
energy is shown on it. One should notice that the number of times 
the wavefunction crosses the x axis inside the box increases by 1 for 
each higher energy state 
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This integral is easy to evaluate and yields 



]\(t>(c)?dx=\C,\^-la. (9.16) 

0 

Because the integral (9.15) has to be equal to 1 to fulfil the normalisation condition, the 
normalisation constant C\ has the form 







(9.17) 



It should be remembered that Q can only be determined up to the constant phase 
factor exp(ia). As we shall see later, this type of phase factor has no physical meaning, 
because it disappears during the calculation of expectation values (see below). Our final 
result thus has the form 



0(x) = — =r-exp(ix«7r/a) - - - tx^{-\xmi/a ) , 

1 / 2 ^ 

or in another notation, 



(9.18) 



(p{x) = 



i sin(x- nn/a ) . 

a 



(9.19) 



As we have seen, the wavefunction (9.18) is associated with a definite energy. Does 
this also hold for the momentum? This is clearly not the case, because it describes both 
a wave with k = n n/a and a wave with k = -n n/a. If we should measure the momen- 
tum, we would thus find values p = hk and p = - hk with equal frequencies. In order 
to derive the probability of occurrence of a given momentum from the wavefunction, 
let us first consider the wavefunction 



V~a 



ex^iixnn/a) 



(9.20) 



which is obviously normalised in the region from 0 to a: 

l2 



0 



1 . nn ^ 

exp 1 X 

Va \ « y 



1 ^ 

dx = —\dx=\ 
a 0 



(9.21) 



When we measure the momentum, it means that we determine a particular value of 
k, i.e., we select one of the components of (9.18). This component is a factor of l/|/2 
smaller than the corresponding component of (9.20). On the other hand, we expect for 
symmetry reasons that both components occur with equal probability =1/2. To go 
from \/\/l to 1/2, of course, we square \/\/2. This observation can be generalised: The 
probability of measuring a given momentum k can be obtained by taking the square of 
the absolute value of the coefficient in front of the normalised plane wave. 

We leave it to the reader as an exercise to explain the relationship between the wave- 
function (9.18) and the momentum (9.5) using the Heisenberg uncertainty relation. 
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9.2 The Schrodinger Equation 

As we saw in the preceding example, there are for a given problem, in this case the 
particle in a box, infinitely many solutions, each with a corresponding energy level 
(9.14). In this case it was relatively easy to find these solutions, which is decidedly not 
the case for other quantum mechanical problems. In such cases it is often useful first to 
look for an equation which determines (//. In the case of the electron which is not sub- 
jected to any forces, we find it as follows: we ask if there is an equation for \j/ such that 
its solutions automatically fulfil the relation 

hcD = ^-—. (9.22) 

Into 



Since the parameters k and a> are found in the de Broglie wave Qxp{ikx-\cot), we can 
formulate the question thus: what must be done to obtain h^k^/lrriQ from exp(iA:x) 
and ha> from exp(-ico/), so that the relation 

(9.23) 

2 mo 



will be fulfilled? If we differentiate Qxp(\kx) twice with respect to x and multiply by 
- h^/2mo, we obtain the left side of (9.23) as a factor. Correspondingly, the right side 
of (9.23) is obtained if we differentiate exp( - icot) with respect to time and multiply by 
\h. In this way we obtain the basic Schrodinger equation for the force-free particle: 

"L^y/=ihy/. (9.24) 

2 mo dx 

It must be said, however, that it is generally not possible to derive the basic equa- 
tions of physics from still more fundamental principles. Instead, one must try to com- 
prehend the physics by heuristic thought processes, to arrive at an equation, and then 
to compare the possible solutions with experimentally testable facts. In this way it has 
been found that the Schrodinger equation is completely valid in nonrelativistic 
quantum mechanics. We generalise (9.24) to three dimensions by writing the kinetic 
energy in the form 

E = ^{pI+p]+pI). (9.25) 

2 mo 



It seems reasonable to generalise the wavefunction to 

exp(i/:^x + i/:^y + i/:^z) exp(-icu/) . (9.26) 

Instead of (9.23) we have the relation 

h^{kl + k], + k\) = ho). 

2 mo 



(9.27) 
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The left side of (9.27) is obtained from (9.26) by taking the second derivatives of (9.26) 
with respect to the position coordinates x, _y and z, adding these and multiplying the 
result by -h^/lrriQ. The right side of (9.27) results by differentiation of (9.26) with 
respect to time and multiplication by i h. We thus obtain the equation 



3^ 3^ \ ..3 

2mo\dx^ dy^ 3zV 9^ 



(9.28) 



The left side can be abbreviated by introducing the Laplace operator 






dx^ dy^ dz^ 



(9.29) 



which yields the usual form of the Schrodinger equation for the force-free particle in 
three dimensions, 



^ w2 9 

V w=\h — \i/ , 

IniQ dt 



(9.30) 



Now we are naturally not so interested in the force-free motion of the particle as in 
its motion in a force field. However, (9.30) gives us a hold on the subject. We see that 
the left side was derived from the expression p^/lrriQ for the kinetic energy by replacing 
it by a differentiation rule - (h^/2mo) V^. This rule acts on ^ and is called the kinetic 
energy operator. In the presence of a potential field, the total energy according to clas- 
sical mechanics is the sum of the kinetic and the potential energy: 

^pUV{r) = E. (9.31) 

Into 

We can arrive heuristically at the total energy operator of the quantum treatment by 
simply adding V to the kinetic energy operator. We thus obtain the time-dependent 
Schrodinger equation in the presence of a potential field: 



' h^ 0 

— V^+ V(r) 
2 mo 



w(r, t) 




t) 



(9.32) 



The expression 



- 



h Q 

V^+K(r) 
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(9.33) 



is called the Hamiltonian (operator). 

The beginner may not be used to working with operators. One can quickly become 
accustomed to them, if one remembers that they are only convenient abbreviations. 
One must also remember that such operators are always to be applied to functions. 

If the potential field on the left side of (9.32) does not depend on time, we can 
proceed from the time-dependent to the time-independent Schrodinger equation. In 
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doing so, just as in (9.9a), we separate a time factor exp( - i cut) from y/(r, t). In quan- 
tum mechanics it is customary to write E/h instead of cu, so that we write 

= . (9.34) 

Since the time differentiation only applies to on the right side of (9.32), we need here 
only to differentiate the exponential function with respect to time, which yields the 
factor E. If we then divide both sides of the corresponding equation by the exponential 
function, we obtain as the result the time-independent Schrodinger equation 

( -^V'+F(r)U(r) = £0(r). (9.35) 

V 2mo / 



As we saw in the preceding example, the wavefunction must ordinarily be subject to 
boundary conditions (9.2). If these are not specified, we apply the so-called natural 
boundary conditions, which require that (// vanishes at infinity, so that the wavefunc- 
tion can be normalised, i.e. 

\W?dV=\. (9.36) 

Before we proceed to the solution of the Schrodinger equation, we shall again take 
up the question of observations, measured values and operators. 



9.3 The Conceptual Basis of Quantum Theory 

9.3.1 Observations, Values of Measurements and Operators 

Determination and Probability of Position 

In the preceding sections, we saw that the explanation of microcosmic processes 
required new ways of thinking which are fundamentally different from the ideas of 
classical physics. In classical mechanics, the motion of a body, such as the fall of a 
stone or the flight of a rocket, can be precisely predicted by the laws of motion. 
According to these laws, the position and momentum of a body can be determined to as 
great a precision as is desired. 

The wavefunction is the new concept which is central to quantum physics. As the 
solution of the time-dependent Schrodinger equation, it describes the time evolution of 
physical processes in the microcosm. In this section we shall explore the physical impli- 
cations of the wavefunction, or in other words, which experimental results the theo- 
retical physicist can predict for the experimental physicist. The (conceptually) simplest 
experiment would be to determine the position of a particle. As we already know, the 
wave function y/ can only make a probabilistic prediction. The expression 

l^dxdydz (9.37) 

gives the probability that the particle will be found in a volume element dxdydz about 
the point x, y, z. We now ask whether the wavefunction can also predict the results of 
observations of momentum. 
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9.3.2 Momentum Measurement and Momentum Probability 

Let us first consider as an example the wavefunction of the particle in a box (Sect. 9.1), 
11 11 

0{x) = — — exp(i/:x)-— exp(-i^x). (9.38) 

\n \fa \/2 ’ 

I 1 I I 

WlW U2{x) 

The two underlined wavefunctions each satisfy the normalisation conditions (9.36). 
According to the basic rules of quantum mechanics, the momentum associated with the 
wavefunction u^{x) is given by hk, while the momentum of the second wavefunction 
Ui(x) is h{-k)= - hk. 

Both of these momenta are thus represented by the wavefunction (9.38). If we 
determine the momentum of the particle in the box described by the wavefunction 
(9.38), we expect to observe either + hk or - hk. However, we cannot predict which of 
the two momenta we will observe. If we imagine that the particle flies back and forth in 
the box, it is intuitively clear that we will observe the momenta hk and - hk with a 
probability of 1 /2 each. As we saw in Sect. 9. 1 , the squares of the absolute values of the 
coefficients Q and Ci give the probability of finding the corresponding momentum. 
We generalise this insight to the determination of the probability distribution of the 
momenta of a generalised wave packet. Here the particle is no longer confined in a box. 
This type of wave packet has the general form 



Wix) = 






— 00 



(9.39) 



In order to connect the coefficients with a probabilistic interpretation, we must 
be sure that the wavefunctions exp(iA:x) are normalised in infinite space. This is some- 
what difficult, and will not be demonstrated here (see Appendix A). We shall simply 
state the result. If we introduce the momentum variable p in the place of the integration 
variable k, and at the same time use the correct normalisation of the wavefunction in 
one dimension, we obtain 

Vix) = Jc{p)-^eP-'f>dp . (9 40) 



The underlined wavefunction is normalised. As a generalisation of our considerations 
above, we see \c(p) \^dp as the probability of observing momentum p in the interval p, 
..., p + dp. This result can be immediately expanded to three dimensions: if we rep- 
resent a wavefunction y/{x,y,z) as a superposition of normalised plane waves, 

+ oo 

W(x,y,z) = inc(/7j^,/7j,,/7^)(27rA)“^''^exp(i/7 • r/h)d^p , with 

— oo 

P • r = PxX+Pyy+p^z , then 

I ciPx,Py,Pz) ?dpx dpy dp^ 



(9.41) 
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is the probability that the components of the observed momentum of the particle p will 
lie in the intervals . •Px+dpx, Py . . .py + dpy, p ^. . .Pz+dp^. 



9.3.3 Average Values and Expectation Values 

To explain these concepts, we think again about the example of the dice. The individual 
possible “observed values” are the numbers of spots, 1, 2, . . . , 6. For a single throw we 
cannot predict which of these numbers we will obtain. We can only make predictions if 
we throw many times and keep track of the frequency with which we obtain the 
number « (« = 1 , 2, . . . , 6). The average number h is then given by 

6 

E nF„ 

h = — . (9.42) 

6 

IFn 

n = \ 



This average value can be predicted statistically (in the limiting case of an infinite 
number of throws) through the use of the concept of probability. This is the ratio of the 
number of times the desired result is obtained divided by the total number of attempts. 
The probability of obtaining n spots (“desired result” is n) is denoted by Since each 

6 

number of spots is equally probable, P 2 - . = Pe- Further, since li = 1 must 

A7=l 

hold, we use the equality of the individual probabilities to obtain immediately 

1/6, « = 1,2,...6. (9.43) 

(We exclude loaded dice.) According to probability theory, h may be expressed in terms 
of as follows: 

= (9.44) 

n = \ 

These relatively simple concepts may be applied directly to the definition of the 
mean value of position and of momentum in quantum mechanics. In general, we can 
make no definite predictions as to which position or which momentum will be meas- 
ured; we can only give probabilities. If we repeat the measurement of position or of 
momentum many times and calculate the mean value, the latter may be defined exactly 
as for the dice. The theoretician can, as we saw in the dice game, predict this mean 
value for the experimentalist. This mean value is therefore called the expectation value; 
it is defined as follows: Expectation value = Sum over the individual values measured, 
times the probability that that value would be found. 

Let us apply this definition to some examples. 

a) Mean Value of the Position (one-dimensional example). Fig. 9.3 

A single measurement yields the result that the particle is to be found in the interval 
x...x-\-dx. The corresponding probability is \ \i/{x)]^dx. Since the position x is con- 
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Fig. 9.3. Explanation of the mean value of the position. The loca- 
tion of a vertical line indicates the value of a measurement of the 
position coordinate x and the length of the line is proportional to 
the frequency with which that value is found (probability density). 
If we interpret the latter as a weight, the calculation of x corre- 
sponds to the calculation of the centre of gravity x of an object 



X (“Position) 



tinuously variable, while the number of spots on the die was discrete, we use an integral 
instead of the sum (9.44). The mean value of the position is thus defined as 



x= j xji/^(x)l^dx. 



(9.45) 



In the calculation of (9.45) and in the following, the normalisation of the wavefunction 
was assumed, i.e. 

Tli/^(x)l^dx=l. (9.46) 

— 00 

Correspondingly, we can take the nth power of x, and then generalise the definition 
(9.45) to obtain the mean value of the nth power: 

= \ x^\ \!/{x) \^dx . (9.47) 

— 00 

If we replace the function x^ quite generally by the potential energy function F(jc), we 
obtain the definition of the mean value of the potential energy, 

V = y V(x) I y/(x) \^dx . (9.48) 

— 00 

b) Mean Value of the Momentum (one-dimensional example). Fig. 9.4. 




P 



Fig. 9.4. Explanation of the mean value of the momentum. 
See caption of Fig. 9.3 
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In this case, we first take the wavefunction to be given by a superposition of plane 
waves: 

\j/(x) = f c{p) . (9.49) 

Yh 

If we now measure the momentum, the probability of finding its value in the interval 
p...p + dp is given by \c{p)\^dp. In complete analogy to the mean value of the 
position, we find the definition of the mean value of the momentum to be 

p= ^p\c{p)f'dp, (9.50) 



or for the nth power 

7" = i°°p'‘\c(p)\^dp. (9.51) 

— 00 

As we shall see later, wavefunctions are normally expressed as functions of 
position, in the form It is therefore difficult to calculate the expansion (9.49) in 
detail in order to determine the mean of the momentum, because the coefficients c(p) 
would first have to be calculated. We shall now show that there is a very simple com- 
putational rule which allows us to calculate the mean value of the momentum without 
following the indirect route via (9.49). 

We assert that the mean value of the momentum is given by the basic formula 

p = T (A 4-) ■ (9.52) 

- 00 \ 1 dx / 



The notation (h/i){d/dx) i//(x) may seem unfamiliar to the reader; it is a common form 
in quantum mechanics. It means that we are to differentiate y/(x) by x, that is, we 
calculate 



h dij/ 
i dx 



(9.53) 



This notation (9.52) is also referred to as applying the “momentum operator” 
{h/\){d/dx) to the wavefunction |^(x). The proof that (9.52) is the same as (9.50) is 
relatively simple, but requires some basic mathematical knowledge. We begin by sub- 
stituting (9.49) in (9.52). After differentiation with respect to x and exchanging the 
order of integration with respect to x and p, we obtain 



p= I dp \ dp'p'c*(p)c(p')— \ Qxp(-ipx/h)Qxp(ip'x/h)dx . (9.54) 

— 00 — 00 h — OO 



The underlined part, however, is merely the Dirac d function, S(p-p') (Appendix A). 
The definition of the d function eliminates the integration over p\ and leads top' = p, 
so that p' is replaced by p. We then obtain directly 
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p=^\dpp\c{p)\\ (9.55) 

— 00 

If we go through the calculation again in detail, we recognise that we have replaced the 
factor p in (9.50) by the differential operator {h/\)d/dx. In order to arrive at (9.51), we 
would have had to apply this operator n times to the wavefunction y/{x). 

c) Average Values of Energy 

Our results to this point enable us to calculate average energy values. The kinetic energy 
of a particle is The probability of observing the momentum p in the interval 

/7. . ./7 + C//7 is given by \c{p) \^dp. 

Thus the average kinetic energy is given by 

E^^=f\c{p)\^f^dp. (9.56) 

- 00 2 AWq 



If we use the computational rule discussed above, we immediately obtain 



■Ckin = in W* 






Into 



vV 



dxdydz. 



(9.57) 



where we have used the abbreviation 









■ + ■ 



a' 



■ + ■ 



8^ 






(9.58) 



and generalised the result to three dimensions. Equation (9.48) can be extended in the 
same way, which yields the expectation value for the potential energy: 



Epoi = 111 W*V(r) V dxdydz ■ 



(9.59) 



Since the total energy is equal to the sum of the kinetic and the potential energy, the 
expectation value for the total energy is, finally. 



E,oi= 111 W* 



n 1 

V^+F(r) 

2 mo 



y/ dxdydz . 



(9.60) 



9.3.4 Operators and Expectation Values 

With the help of the above results, we can now discuss the conceptual framework and 
the computational rules of quantum theory. In classical physics, we have certain me- 
chanical parameters, like the position x{t), momentum p{t), energy, etc. In quantum 
theory, these classical parameters are assigned certain expectation values [compare 
(9.45, 52, 60)]. These quantum mechanical expectation values can be obtained from 
classical physics by means of a very simple translation process according to the follow- 
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ing “recipe”: The classical parameters are assigned operators, which are nothing but 
multiplication or differentiation rules, which act on the wavefunctions following them. 
The position operator x is assigned to x{t), which simply says that one multiplies the 
wavefunction y/{x) by x. It may seem strange at first that a time-independent operator 
X can be assigned to a time-dependent parameter x{t). As we shall see below, however, 
the time-dependence is reintroduced in the process of finding the average, if the wave- 
function itself is time-dependent. The momentum is assigned the operator - ih{d/dx) 
which differentiates the wavefunction. After carrying out the appropriate operator 
multiplication or differentiation, one multiplies the result by >//* and integrates over all 
space to obtain the quantum mechanical expectation value. 

Using these rules, we can define still other operators which we have not yet con- 
sidered. One important parameter is the angular momentum /, which has the com- 
ponents lx, ly and /j. In classical physics, 4 , for example, is defined as xpy-ypx- In 
quantum theory we obtain the corresponding operator by replacing Px and Py by 
(^/i)0/0x and (^/i)0/0y, respectively. The z component of the angular momentum 
operator is thus 



HQ 0 

= -x y — 

0y 0x 



(9.61) 



In order to prevent confusion between the classical angular momentum and the angular 
momentum operator, we use here and in the following text the symbol "(read “hat”) 
over the angular momentum operator. 

The following table summarises what has been said above. 



Classical Operator Quantum theoretical 

variable Expectation value 



Position a:(0 



Momentum p{t) 



X 



X -\\i/*{xj)x\i/{x,t)dx 



— — — (Jordan’s rule) 
i dx 



p = \y/*{xj)^ -^\i/{x,t)dx 
1 dx 



Energy 

E = J^(x(t\p{t)) 






liriQ dx' 



- + V{x) 



£■ = j if/*{x, t) 



IrriQ 



dx^ 



y/{x, t)dx 



Angular momentum 
l=[rxp] 




I 




xj/dxdydz 



In the preceding discussion, we have given no consideration to the wavefunction y/, 
which has, so to speak, fallen from heaven. We must still consider the principles by 
which we can determine the wavefunction, in case it is not determined by the 
Schrodinger equation. 



9.3.5 Equations for Determining the Wavefunction 

We have already presented equations which were explicitly or implicitly applicable to 
the determination of y/. As the simplest example, let us take the plane wave 
y/ ~ exp(iA:x). As we already know, this wave determines the propagation of a particle 
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with momentum hk. Can we regard this plane wave as a solution of an equation which 
relates directly to momentum? This is in fact the case, because if we differentiate the 
plane wave with respect to x and multiply by h/i, we obtain the relation 

= (9.62) 

The plane wave thus satisfies an equation of the following form: The momentum 
operator {h/i)d/ dx applied to the plane wave yields p = hk times the plane wave. 

As a second example, let us consider the time-independent Schrddinger equation. 
The application of the Hamiltonian operator to the wavefunction gives an energy E 
times the wavefunction. A glance at the above table shows that the Hamiltonian is 
precisely that quantum mechanical operator associated with the classical energy expres- 
sion + EpQ^. 

When we extract what is common to these examples, we see that these functions are 
so-called eigenfunctions which satisfy the following equation: 

Operator • Eigenfunction = Eigenvalue • Eigenfunction . 

If we denote the operator by Q, the eigenfunction by 0 and the eigenvalue by <u, this 
relationship is 

Q<p = o)<l). (9 63) 

The eigenvalue indicated here and in Sect. 9.3.6 following should not be confused with 
a frequency. It can have quite different physical meanings, e.g. momentum. In the 
example (9.62), we had 

^ = ——, 0 = e ^ , co=hk. 

I dx 

We must now make use of a few basic facts of the mathematical treatment of such 
eigenvalue equations without being able to derive them here. As can be shown mathe- 
matically, eigenfunctions and eigenvalues are determined by (9.63), if appropriate 
boundary conditions for the wavefunction (eigenfunction) are given. One example for 
a set of boundary conditions is the particle in a box. If no explicit boundary conditions 
are given, we must require that the wavefunction be normalisable, which implies that 
the wavefunction must go to zero rapidly enough as infinity is approached. 

When the operator £2 in (9.63) and the boundary conditions are given, there is a 
particular sequence of eigenvalues, e.g. discrete energy values as in the particle in the 
box, etc. The calculation of these eigenvalues and the associated eigenfunctions is thus 
the task of mathematicians or theoretical physicists. In order to make them agree with 
experimental observations, one makes use of the basic postulate of the quantum 
theory: the eigenvalues are identical with the observed values. This basic postulate has 
enormous significance, and we can accept it, because it has been repeatedly confirmed 
in innumerable experiments. If we measure the energy of the electron in a hydrogen 
atom, for example, this must agree with the quantum mechanically calculated eigen- 
values E„. If there is a discrepancy, one does not impute this to a failure of quantum 
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theory, but rather looks for interactions which have not yet been taken into account. In 
this way, an excellent agreement has so far been attained. 

As we can see from our example (9.62), the Schrodinger equation is only one of 
many possible ways to determine the wavefunction. We are always concerned here with 
the physical problem. Thus whenever we use the Schrodinger equation, we would 
always assume that we have access to observations which measure the energy exactly. 
When we have then measured the energy, we have identified the associated eigenfunc- 
tions as solutions of the Schrodinger equation. We might also wish to measure the 
momentum. Since the wavefunction is known and, as one can easily demonstrate by 
Fourier analysis, this function contains several momentum eigenfunctions, we are no 
longer able to predict exactly the momentum of the particle, but can only calculate the 
expectation value. The simplest example for this is again the particle in the box. 



9.3.6 Simultaneous Observability and Commutation Relations 

As we saw above, there is a very close relationship between wavefunctions and eigen- 
values on the one hand and individual observations on the other. If a wavefunction is 
an eigenfunction for a particular operator - that is, if it satisfies an equation like 
(9.63) - then we know that the eigenvalue will be found by measurement. If we repeat 
this measurement, we shall find exactly the same eigenvalue. If follows from this that: 

If y/x is an eigenfunction of a specific operator Q, the eigenvalue a>x agrees with the 
expectation value Q. In fact, if we know the operator Q and the associated eigenvalue 
a>x, then 

^Wk=(OxH/}:.Q=\wt^Vxdx = \wto)xVxdx=cO},\ii/fVkdx=o)x- 

What happens, though, when we want to determine another parameter with the 
second measurement? One example for this was examined in more detail in Sect. 7.3, 
where we wanted to measure first the momentum and then the position of the particle. 
In that case, the measurement of position destroyed the results of the previous momen- 
tum determination. On the other hand, we can measure first the momentum and then 
the kinetic energy of a particle. In the first measurement, we obtain a certain value p. 
We have then “prepared” the particle into a particular state which is an eigenfunction 
of the momentum operator; the wavefunction after the measurement is thus (aside 
from a normalisation factor) given by exp(ipx/^). If we now measure the kinetic 
energy, this measurement corresponds to the mathematical operation of applying the 
kinetic energy operator, - {h^/2mo)d^/dx^. In the process, the “prepared” plane wave 
yields the eigenvalue E = p^/2mo, and the plane wave remains as wavefunction. In this 
case, the second measurement does not destroy the result of the first measurement. 
There exist, apparently, measurements which do not disturb each other, or, in other 
words, which can be simultaneously carried out with arbitrary accuracy. 

We will now derive a necessary criterion for simultaneous measurability. For this 
purpose, we consider the operators and which could, for example, be 
operators for the momentum and the kinetic energy. We now require that the wave- 
function y/ be simultaneously an eigenfunction of both characteristic equations 

(9.64) 

and 



y/ = y/ 



(9.65) 
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If we apply operator to the left side of the first equation and operator to the 
second equation, then subtract one equation from the other, rearrange, and finally 
apply (9.64) and (9.65) again, we obtain 

w = ^ = 0. (9.66) 

The simultaneous measurability of all wavefunctions which simultaneously fulfil (9.64) 
and (9.65), not merely special cases, should be guaranteed. Therefore the y/ in (9.66) is 
omitted in quantum theory, and one writes 



i2(i)f2(2)_ qO)q(\) ^ 0 _ (9 57) 

This, however, should be understood to be an abbreviation. When one sees such an 
equation, one should always remember that any desired wavefunction y/ stands to the 
right of the operators, i.e., (9.66) applies. It can be mathematically shown that the 
converse of the above is also true: if two operators and fulfil the commutation 
relation (9.67), then eigenfunctions of can always be determined to be eigenfunc- 
tions of as well; they fulfil (9.64) and (9.65). If there is only a single eigenfunction 
belonging to the eigenvalue of this is itself an eigenfunction of 
However, if there are several eigenfunctions of associated with then it will 
always be possible to find linear combinations of these which are also eigenfunctions of 

Let us consider a few examples. If we choose as the momentum operator 
(h/i)d/dx, and the kinetic energy operator ( - h^/2 mo) d^/dx^ as these operators 
commute. The result of differentiating a wavefunction twice and then once with respect 
to X is naturally the same as that of differentiating first once and then twice with respect 
to x: 



2 mo 



h / d 

i \ dx dx^ 



d^ d 
dx^ dx 



= 0 . 



(9.68) 



It can be shown in the same way that the x components of the momentum and the y 
components of the position mutually commute. 

Let us take as a second example the x component of the momentum and the 
coordinate x itself. Thus = (h/i)d/dx and = x: 

(ILJ^x-x- yy. (9.69) 

\idxidxj 



We now evaluate this expression. First we remove the parentheses: 



h d h dw 

= xy/-x — . 

\ dx i dx 



(9.70) 



d/dx means, of course, that everything to the right of the operator is to be differen- 
tiated, and 




9.3 The Conceptual Basis of Quantum Theory 



141 



d , ^ dx dw 

— {xw) = — xu^-x—^ . 
dx dx dx 



If we substitute this in (9.70), we obtain 



h 



If we again write out the right side of (9.69), we obtain the relation 



(9.71) 



(9.72) 



h d h d \ h 

— ^ !// = — (//. (9.73) 

\ dx \ dx ) 1 

Since this relationship holds for any function (//, one can also write in abbreviated form 

(9.74) 



h d h d 

x-x 

\ dx i dx 



h 

i 



This is the famous Heisenberg commutation relation between the momentum operator 
and the position operator. It says that the momentum and the position operators do not 
commute, which means that the position and momentum cannot be simultaneously 
determined to any desired degree of precision (see Sect. 7.3). 

The following formulation is often used to express the commutation relation 
between the two operators and 

(9.75) 



In this form, the Heisenberg commutation relation is 



h J__ 
i dx 






h 

i 



(9.76) 



We leave it to the reader to derive the following relations: 



A A V 

i dx ’ 



h dV 



i dx 



For the components of the angular momentum [compare the definition in (9.61)]. 



[^) ^y\ 1 ^^ » 

[A 4] = 

lA/}] = o, 



(9.77) 

(9.78) 

(9.79) 



j^x,y,z. 



(9.80) 
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These equations say that the components of the angular momentum are not simul- 
taneously measurable, although one component and the square of the angular momen- 
tum can be simultaneously measured. 



9.4 The Quantum Mechanical Oscillator 



Aside from the particle in the box, the harmonic oscillator is one of the simplest 
examples of quantum theory. Although this example does not apply to the motion of 
an electron in an atom, because a different force law applies there, the harmonic oscil- 
lator has innumerable applications in all areas of quantum physics. We shall return to it 
repeatedly. In classical physics, the equation of motion of the harmonic oscillator is 
given by = - kx (Fig. 9.5). The associated kinetic energy is {rriQ/2)x^ and the po- 
tential energy (k/2)x^. To convert this to quantum mechanics, we express the velocity x 
in terms of the momentum: moX = p, We also make use of the classical relation 
between oscillation frequency a>, mass and force constant, a>^= k/rriQ, In this way we 
obtain the following expression for the total energy (or mathematically expressed, for 
the Hamiltonian function): 

^ = -^ + — coV. (9.81) 

2wo 2 




Force* - kx 





Fig. 9.5. The harmonic oscillator. Above^ as an example, a point mass on a 
spring. Middle, force as a function of the displacement x. Below, potential 
energy as function of the displacement x 



The corresponding Schrodinger equation is 

(- ^ • (9.82) 

y Into dx^ 2 j 

As one can easily convince oneself, the energy E can only have positive values. We 

ensure this by multiplying both sides of (9.82) by and integrating from x = - oo 

+ 00 

to x= + 00 . The integral on the right side of (9.82), j \f/*y/dx, is positive, because 
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+ 00 

The same applies to the term containing x^, | {niQ/l) oP' x^\y/f‘ dx 

+ 00 

on the left side of (9.82). The remaining term, J [-(^V2 a«o) V'*’ d^\f//dx^]dx, is 
rearranged by partial integration to yield " °° 



h'^ ^dy/ dij/* , 

+ j —Z 1 _ dx 

Into dx x=-x 2w?o -«> dx dx 



(9.82a) 



If we substitute the limits of integration in the first term, it disappears, since we require 

+ " , 

that y/-*0 as x^±oo. (Otherwise the normalisation integral J \\j/\ dx= 1 would 

— QO 

not exist! ) The integral in (9.82a) is positive, and thus, so is the entire expression corre- 
sponding to the left side of (9.82). It now immediately follows that £ ^ 0. 

Since the Schrddinger equation includes several constants, we first carry opt a trans- 
formation to a new, dimensionless coordinate ^ and a new energy, by setting 



X = 



h 

mo CO 






E 

hco 



¥(x) = d>{0 . 
Then (9.82) becomes 



(9.83) 

(9.84) 



(9.85) 



If the differentiation operator d/d^ were an ordinary number, we could use the rule 
-a^+b^ = {-a + b){q+b). Although this is naturally not possible with operators, we 
shall use it as an heuristic aid and write, so to speak, experimentally. 



1 

w 








d^ 



+ ^0(0- 




(9.86) 



The order of the differentiation steps must be strictly observed here, that is, operators 
on the right must be applied before operators to the left of them. Let us now 
“multiply” out the parentheses, strictly observing the order of operations: 



1 

2 









di 



di 



HO- 



(9.87) 



This is the left side of (9.85), with an extra term. Just as we did with the Heisenberg 
commutation relation (9.69), we can apply the differentiation in the extra term to the 
wavefunction, and we obtain - 0(<^)/2 for the second expression in (9.87). Equation 
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(9.86) thus differs from the middle expression in (9.85) only by the term - 0/2. If we 
observe this and introduce, as shown in (9.86), the abbreviations b and the original 
Schrodinger equation (9.82) can be given in the form 

= (^- 1) 0 = (e - X) 0 . (9.88) 

In the following it is important to remember that b and b^ are only certain abbrevia- 
tions for operators, which are defined in (9.86). If we also substitute and 

provide the wavef unction 0 and this n with an index A, the justification for which will 
be given below, we finally obtain the Schrodinger equation in the form 



b^b(!)x. = nx0),. 



(9.89) 



The operators b and b ^ satisfy the commutation relation 

bb^-b^b=l . (9.90) 

We shall leave the proof of (9.90) to the reader as an exercise. One needs only to sub- 
stitute the definition of and b and then proceed as above with the Heisenberg com- 
mutation rule. 

Let us first consider (9.89) generally and multiply it from the left by the operator b, 
i.e., we apply the operator b to the left and right sides of (9.89). We then obtain 

bb^b0;, = n;,b0;,. (9.91) 

According to the commutation relation (9.90), we can substitute l-l-Z7'^Z7 for bb^ . 
When we do this with the first two factors on the left side of (9.91), we obtain 



Z? + Z?(Z? 0;t) + Z? 0;t = 0A » (9.92) 

or, if we combine the terms containing b^x on the right, 

b^b(b(/>x) = (nx-l)(b0x)^ (9.93) 

As we see, application of b to the wavefunction 0x produces a new wavefunction 
0 = (b0x) which satisfies (9.89), although its eigenvalue is 1 less: The 

operator b thus reduces the number nhy 1, We refer to it as an annihilation operator. 
Since, as we observed earlier, the energy E must be positive, n must have a lower limit. 
There must therefore be a lowest number n^ and a corresponding wavefunction 0 q for 
(9.89). If we were to repeat this formalism on the lowest eigenstate with A = 0, we 
would introduce a contradiction. We would have found a wavefunction with a still 
smaller eigenvalue, contrary to the assumption that 0o is already the lowest eigenstate. 
The contradiction is only resolved if Z?0o is identically equal to zero. Then (9.89) is 
fulfilled trivially for each value of n\ zero is, however, not a genuine eigenvalue. For 
the lowest state, we then have the condition 



Z?0o = 0 . 



(9.94) 
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If we replace b with the operator which it symbolises (9.86), then (9.24) is equivalent to 
(^+<j)0o = O. (9.95) 

This first-order differential equation can also be written in the form 

(9.96) 

4>o 

from which we obtain on integration 

ln0o= -y^^+C' , (9.97) 

or, taking the antilogarithm, 

0o=Cexp(-|^^). (9.98) 

The constant C must be determined by the normalisation condition. 



4 



3 

2 




3 

2 




b 



1 

n = 0 



n = 0- 



Fig. 9.6. Illustration of the effect of the creation and annihilation 
operators. Left: Application of means climbing up the “ladder” 
of states n = 0,\,. . . by one rung. Right: Application of b corre- 
sponds to climbing down by one rung 



We will now investigate what happens if we apply not the annihilation operator b 
but rather the operator on both sides of (9.89). By analogy to the steps (9.91 - 93), 
we obtain using (9.90) the relation 

b + b(b+())A) = («A+l)(b^0A) . (9-99) 

i.e. by application of to 4>x, we increase the eigenvalue by one unit. Therefore, b^ is 
called a creation operator (Fig. 9.6). If we choose the ground state for cpx, we obtain 
a proportionality 



0lOcb'^(/>O, 



and a second application of b ^ gives 
> 6tc. 




146 



9. The Mathematical Framework of Quantum Theory 



Here we have used a proportionality sign and not an equals sign, since we do not yet 
know whether the functions (pQ, (6“^)^0o> are normalised. In general, we obtain 

= (9.100) 



where the constant factor serves as normalisation factor. 

Since n always increases by an integer on application of but the lowest eigen- 
value is zero («o = 0), we may identify the index A with n. -Including the normalisation 
factor (which we will not derive here), C„ = \/]/^., we find the normalised wavefunc- 
tions: 

= — L.(6+)"0o. (9.101) 



Relation (9.101) still looks terribly abstract. We shall therefore show by means of 
several examples how the explicit wavefunctions may be derived; for this purpose, we 
shall leave the normalisation factor out of consideration. For a? = 0 we have already 
obtained Using (9.88, 83), we find for the lowest energy value 

Eq = h(o/2, the same zero-point energy which we have already discussed in Sect. 7.5. 
For n = i WQ obtain 



or, using the explicit expressions for and 0 o. 

01 oc -A + exp( - i(^^) . 

After carrying out the differentiation we have 
0ioc<jexp(-|<^^). 

The corresponding energy is 
E = {3/2)h(o. 

For n = 2v/e obtain 

02 oc &+ 01 oc A + . ^exp( - 1^2) ^ 

or, after differentiating, 

02«(2(^^-l)exp(-|^^) . 

For the energy we find 



E = (5/2)h(o. 




9.4 The Quantum Mechanical Oscillator 



147 



If we continue this procedure, we obtain polynomials through multiplication by ^ 
or differentiation with respect to In general, for the «th wavefunction we obtain an 
expression of the type 

= (9.102) 

where H„ is a polynomial which is known in the mathematical literature as a Hermite 
polynomial. The corresponding energy is given by 

E„ = (n + j^)^co, « = 0, 1 , 2 . . . (9. 103) 

(Fig. 9.7). For completeness, we shall give the formula for finding the Hermite poly- 
nomials. It states 



E 



n 




3 

2 

1 

0 



Fig. 9.7. The energy levels of 
the harmonic oscillator 



H„(0 



(- 1 )" 1 

l/^ ' ^417^ 



(9.104) 



If we return from the coordinate to the original coordinate x, the correctly normalised 
eigenfunctions of the Schrodinger equation of the harmonic oscillator are given by 



4 

= 1 



exp(- j-x^moco/^) • H„(x]/mocoZh ) . 
h 



(9.105) 



In Fig. 9.8, we have plotted the potential V{x). Furthermore, the energy levels 
(n + l/2)ho) are given along the ordinate, as are, finally, the wavefunctions them- 
selves. The first four wavefunctions in the energy scale are shown in more detail in Figs. 
9.9a, b. Although we will for the most part use the configuration representation y/{x) 
for the wavefunctions in this book, the creation and annihilation operators and b 
are indispensable in many areas of modern quantum theory. 





Fig. 9.8. A representation of the quantum mechan- 
ical harmonic oscillator which is often found in 
books. This figure contains three drawings in one: 1) 
The ordinate means total energy E. The horizontal 
lines (above the a: axis) give the quantised energy 
levels. 2) The ordinate gives the potential V{x). The 
dashed curve shows the shape of the potential curve 
as a function of position x. 3) Each of the horizontal 
lines serves as an x axis, on which the shape of the 
wavefunction of the corresponding energy is plotted 



i Fig. 9.9. a) The wavefunctions of the harmonic oscillator for n = 0,]. b) The wave- 
functions of the harmonic oscillator for n - 2,3 
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Problems 



9.1 By substituting the wave packet with co = hk^/(2mo) from Problem 7.1 into the 
Schrodinger equation, convince yourself that it is a solution for a particle which 
experiences no forces. 

9.2 Let the wavefunctions and 02 be solutions of the Schrodinger equation (9.35) 
with the eigenenergies E\ and £’ 2 - Show that 

y/(r, t) = Cl exp( - i£i t/h) 0i (/*) + C 2 exp( - lEit/h) 02 ('*) 

is a solution of the Schrodinger equation (9.32). What conditions must Ci and C 2 satisfy, 
in order to normalise y/{r, t)l Generalise this exercise to the wavepacket 

^(^ 0=1. Cj exp( - \Ejt/h) 0j{r ) . 

J 

Hint: 

\4>nr)(l>k{r)dV=djS 

(^ = 1 for j = k . 

9.3 The potential V{r) is represented in one dimension by - /]S(x), where d{x) is the 
Dirac d function (see the Mathematical Appendix). Solve the Schrodinger equation for 
bound states, i.e. for £ < 0. 

Hint: Solve the Schrodinger equation for x<0 and x>0, in other words where 
S(x) = 0. 

Where x = 0, the solutions found for y/_ and y/^ must join in a continuous fashion. 

Also, derive a second boundary condition (“jump condition”) for y/'_ and y/'^ by 
integrating the Schrodinger equation over - £<x<£, e-^0. Write the wavefunction so 
that it can be normalised, and find the normalisation constants and the energy. 

9.4 Find the bound states of a particle in a one-dimensional box, for which the 
potential is 



V(x) = 0 


for 


x< -L 


V(x) = - Ko<0 


for 


-L ^x^L 


V(x) = 0 


for 


x>L . 



Hint: Solve the Schrodinger equation in the three subregions. Require y/{x)^0 for 
x^ ± 00 ; y/(x) and y/'(x) are continuous atx= ±L. Display the eigenvalue spectrum 
for £ < 0, and discuss its dependence on L and Vq. 

9.5 Calculate the “scattering states”, in which £ > 0, for a particle moving in the S 
potential of Problem 9.3. 

Hint: Use the trial solution ^(x) = exp(i/:x) + t7exp(-i^x) for x<0 and y/ = b 
exp(i/:x) for x > 0, and determine a and b. What is the physical interpretation of this 
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trial solution in the field of wave optics? It does not need to be normalised. How do a 
and b change when the sign of p is changed, i.e., when the potential is repulsive? 

9.6 Let an otherwise free particle collide with an infinitely high potential barrier. 
What is its wavefunction (without normalisation)? 

9.7 For the one-dimensional wave packet of Problem 7.1, calculate the expectation 
values of the position x, momentum p, kinetic energy, and x^. Why is the expectation 
value of more informative than that of x? 

9.8 Express the energy expectation value for the wave packet of a free particle in 
Problem 7.1 in terms of the energy eigenvalues of the kinetic energy operator. 

9.9 Prove the commutation relations (9.77 - 80) for angular momentum. 

Hint: Use the quantum mechanical definition of the angular momentum operator and 
the commutation relations between position and momentum in three dimensions. 

9.10 Demonstrate the commutation relations between ^ and x, and between 4 and the 
central potential V{r) which depends only on r = |r | . 

9.11 The two functions i//] and y /2 are to vanish at infinity. 

Show that 



+ oo 

I \i/fxy/2dx = 

— 00 



+ 00 

I y/^xii/^dx 

— 00 



* 



Y ^ h d . rr * ft 

j WX — — ¥idx = \ y/i — 

-00 I dX \-0O 1 

\ y/X Widx = i \ y/^ y/idx\ . 

- oo \ - oo / 

The properties of the operators x,p = — — , jf^— + V(x), which are to 

i dx Into dx 

be proved here, indicate that these operators are Hermitian. 

Hint: Carry out partial integrations over d/dx and d^/dx^. 




9.12 Prove the Ehrenfest theorem 



d 

mo—x = p, 
dt 




dV 

dx 



for the one-dimensional quantum mechanical motion of a particle. 
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Hint: Use the definition of the operators x, p and dV/dx, and the fact that y/ (and (//*) 
satisfy a Schrodinger equation with the potential V(x). Make use also of the result of 
Problem 9.11. 

What is the expression for this theorem in three dimensions? 



9.13 Calculate the wavefunctions and energy values of a particle which is subjected to 
a force F= -kx + ko, {k = moco^). 

Hint: Set up V(x) and derive the new Schrodinger equation from the “old” one for the 
harmonic oscillator by means of a coordinate transformation. 



9.14 Prove the commutation relation (9.90) 
bb^-b^b = \ 

for the operators b and b^ of the harmonic oscillator. 

Hint: Use the definitions of b^ and b (9.86) and the commutation relation between x 

and — — (9.74). 
i dx 



9.15 Construct the wavepacket 



\l/= y/Q exp 




+ y/x exp 



. 3cu ^ 

1 1 

2 



from the first two states of the harmonic oscillator and examine the change in | ^ with 
time by means of a graphical representation. 



9.16 Let the Schrodinger equation of the harmonic oscillator be 
b^b(j)„ = n0„ (« = 0,1,2,...), 

where =(1/1/2) ^,b = (\/]/2) 0 = 0(<^). For6,6 + ,thecom- 

mutation relation [b,b*] = \ holds. 

Prove the following relations. The integrals extend from - oo to + oo. 

a) \[b^0m* 

b) \(b^0„)Hb^0n)d^=in + \)\0Und^ 

c) If 0„ is normalised, then = \/\/n + \b^ 0„ is also normalised. 
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d) The normalised functions <t>„ can be expressed as 

0„ = i/lAT(&+)>o> &0o = o. 

e) = + 60„ = 1A0 „-i. 

f) b(J}^T-(b^)"b = n{b^)''-\ 6 + (6)"- (*)"*+= . 

db 

d 

Hints: a) Use the explicit expressions for b* and b in terms of and partial inte- 

gration. 

b) Use a), the exchange relation and the Schrddinger equation. 

c) Follows from a). 

d) Mathematical induction method. 

e) Follows from d) and the commutation relations. 

f) Solve by the induction method (write bib^)" -{b*)"b as b{b^)" -(b"")"~^ b* b). 

9.17 Calculate the expectation value of the momentum, tfie kinetic energy and the 
potential energy for the «th excited state of the harmonic oscillator. 

Hint: According to (9.83 and 84), change from x to <^, transform ^ and d/dli into b* 
and b, and use 

\d>m(t>m{Od^= drr,n = 

«,m = 0,1,2, ... . 

9.18 Prove for the wavefunctions of the harmonic oscillator, 0„(^): 

\d>^iO(t>n{Od^=S,„n. 

Hint: Use the fact that 

0„ = ^r(6^)>O, 



for m^n 
for m = n , 



b0o = O, 



and the result a) of Problem 9.16. Proceed by induction. 



9.19 The bra and ket notation 

The English physicist Dirac introduced a very concise notation, especially for expecta- 
tion values and wavefunctions, which we shall demonstrate here for the case of the 
harmonic oscillator. 

Instead of one writes \n). The integral is presented as {n\n), and 

the expectation value \0niOb0niOd^ {n\b\n). Since < > is a “bracket”, <A 2 |is 
called “bra”, and \n) is called “ket”. Using the results of Problems 9.16 and 9.18, show 
that 
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a) 6+|«> = j/rt + 1 |« + 1> 
b \n) = \/n \n-\) 

b) <«|w> = (5„ 

c) {n\b\n) = 0 
{n\b* \n) = 0 

d) Calculate <«|(Z7‘'' + &)^|«>and (,n\{b^ -bf‘\n). 

What is the physical significance of these expectation values? 




10. Quantum Mechanics of the Hydrogen Atom 



10.1 Motion in a Central Field 

In this chapter, we shall solve the Schrodinger equation of the hydrogen atom. For our 
calculations, we will not initially restrict ourselves to the Coulomb potential of the 
electron in the field of the nucleus of charge Z, V{r) = - Ze^/(4neor), but rather will 
use a general potential F(r), which is symmetric with respect to a centre. As the reader 
may know from the study of classical mechanics, the angular momentum of a particle 
in a spherically symmetric potential field is conserved; this fact is expressed, for 
example, in Kepler’s law of areas for the motion of the planets in the solar system. In 
other words, we know that in classical physics, the angular momentum of a motion in a 
central potential is a constant as a function of time. This tempts us to ask whether in 
quantum mechanics the angular momentum is simultaneously measurable with the 
energy. As a criterion for simultaneous measurability, we know that the angular 
momentum operators must commute with the Hamiltonian. As we have already noted, 
the components ly, and of the angular momentum I are not simultaneously meas- 
urable; on the other hand, and /^, for example, are simultaneously measurable. A 
long but straightforward calculation reveals that these two operators also commute 
with the Hamiltonian for a central-potential problem. Since the details of this calcula- 
tion do not provide any new physical insights, we shall not repeat it here. 

In quantum mechanics as well as in classical mechanics, we may thus measure the 
total energy, the z component of the angular momentum, and the square of the angular 
momentum simultaneously to any desired accuracy. In the following, we shall there- 
fore seek the simultaneous eigenfunctions of P, 4, and We remind the reader that 
we denote the angular momentum operators by a " (hat), in order to distinguish them 
from the classical quantities /. Since we are here dealing with a spherically symmetric 
problem it is reasonable not to use Cartesian coordinates, but to change to another 
coordinate system which better reflects the symmetry of the problem. This is naturally 
the spherical polar coordinate system. If we choose a particular point x,y,z in 
Cartesian coordinates, we shall describe its position by means of the following coor- 
dinates (Fig. 10.1): 




y 



X 



Fig. 10.1. Illustration of spherical polar coordinates 
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1) its distance from the origin, r, 

2) the angle 6 between the z axis and the vector r, 

3) the angle 0 between the x axis and the projection of r on the x-y plane. 
Recalculating the Laplace operator = d^/dx^+d^/dy^+d^/dz^ in terms of 

spherical polar coordinates is a lengthy mathematical procedure, which however con- 
tributes nothing to the understanding of quantum mechanics. We shall therefore 
simply write down the kinetic energy operator in polar coordinates; it is 



with 




1 8 / 2 8 V 1 p 

IniQ Q'' \ 9''/ 2mor^ 






_J ^ 

sin0 60 



sin0- 



00 



1 8^ 
sin^0 00^ 



( 10 . 1 ) 



( 10 . 2 ) 



We note that the operator P is nothing other than the square of the angular 
momentum operator, and contains only derivatives with respect to angles. Since the 
potential of our problem depends only on the radius coordinate r, it is reasonable to 
separate the radial and angular functions in a trial wavefunction as follows: 

i//(r,e,<l>) = R(r)F(e,0), (10.3) 

i.e., we write the wavefunction as the product of a function which depends only on r 
with a second function which depends only on the angles 6 and 0. If we insert (10.3) 
into the Schrodinger equation 



V{r) 

2 mo 



y/ = Ei//, 



(10.4) 



we obtain 

Jf y/ = F(6, (p) 



Jp_ J_ 

2 mo P dr 



r^—]+ V{r) 
dr I 



R(r) + pF{d, 0) = ERF . 



2mor 



(10.5) 



We now make use of our recognition of the fact that the wavefunction (10.3) can be 
chosen to be an eigenfunction of P and of 4 as well as of We write the correspond- 

ing eigenvalues in the (arbitrary) form h^co and hm. These new, additional equations 
are then ^ 



PF{6, 0) = h^(DF{9, 0) and (10.6) 

l,F(e, 0 ) = hmF(e, 0 ), (10.7) 

Note that m in (10.7) is the “magnetic quantum number” and must not be confused 
with the mass. 



In literature one often uses Y instead of F. 
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By assuming that (10.6) is already solved, we can express the term R{r)/ 
(2mor^)PF{6,(p) in (10.5) in a simple form through the eigenvalue h^co. We then 
have eliminated all derivatives with respect to 0 or 0 on the left side of (10.5), and we 
may divide both sides of (10.5) by F{6, 0). We thus obtain an equation for the radial 
part R(r) alone: 



J_ A (pA. 

Into P dr y dr 



+ V(r) + 



h^co 

2mor^ 



R(r) = ER(r). 



( 10 . 8 ) 



We have reduced the task of solving the three-dimensional SchrOdinger equation 
(10.4), to that of solving the (as we shall see) simpler equations (10.6, 7 and 8). 

Since the quantity h^coin (10.8) is still an unknown parameter, which occurs as an 
eigenvalue in (10.6), our first problem is to determine this eigenvalue. We thus begin 
with the task of solving (10.6) and (10.7). 



10.2 Angular Momentum Eigenfunctions 

The first part of this section is somewhat more abstract. For the reader who would like 
to see the results first we give them here in compact form: 

The eigenvalues of the square of the angular momentum P are 

h^l(l+l), (10.9) 

where / is an integer, 

/ = 0 , 1 , 2 ... . 

According to (10.7), the eigenvalues of the z component of the angular momentum are 
hm . 

The integer m is called the magnetic quantum number, and takes on the values 

The wavefunctions F{6, 0) naturally depend on the quantum numbers / and m and 
have the form 

F,,J0,0) = e*'"^Pr(cos0). (10.10) 

These functions are drawn in Fig. 10.2. is called a Legendre polynomial, and Pf 
with m + 0 is called an associated Legendre function. The entire function (10.10) is 
called a spherical harmonic function. 

We first address ourselves to the task of finding the eigenfunctions F as the 
solutions to (10.6, 7). We write (10.6) again, giving the components of / explicitly: 



{P+i^+P,)F,,^ = h^co,F,,„. 



( 10 . 11 ) 




f the Hydrogen Atom 

z 
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Furthermore, we derive a new equation from (10.7) by applying the operator ^ to both 
sides and then using (10.7) once more. This yields 

= ( 10 . 12 ) 

If we subtract (10.12) from (10.11), we obtain 

{ti+rj)F,^^ = h\o>,-m^)F,^^. (10.13) 

If we multiply both sides of this equation from the left by and integrate over the 
coordinates 6 and 0, we can show, in a way similar to that used with the harmonic 
oscillator, that 



(10.13a) 

In analogy to the harmonic oscillator, it seems reasonable to write ^ as the 
product of two factors /+ = /^+i/^ and /_ = ily. We might suspect that these new 
linear combinations, like the operators and b with the harmonic oscillator, are a 
kind of creation and annihilation operators. As already stated in (9.77-80), the fol- 
lowing commutation relations hold between the angular momentum operators: 

[PJj] = 0 , j = x,y,z, 

Ux9 ^y\ — i » 

Uyi ~ > 

[l,JA=ihly. (10.14) 



Further commutation relations can be derived from these by simple algebraic trans- 



formations: 

[Pj±] = 0, (10.15)2 

[r„lj = ±hh, (10.16) 

[1^,!,] = (10.17) 

In order to demonstrate that /+ is a kind of creation operator and /_ a correspond- 
ing annihilation operator, we consider 

(10.18) 

To find an equation for this quantity, we apply /+ to the left of each side of (10.6) and 
then obtain, due to the commutativity with P, the equation 

P(f.F,,J = h^co,{hF,,^). (10.19) 



This means that if F/ ^ is an eigenfunction of (10.6), so is the function (10.18). We now 
apply /+ to the left of both sides of (10.7) and then, because of the commutation 
relation (10.17), we obtain after rearranging 

. ( 10 . 20 ) 



^ /+ means that (10.15) holds for both /+ and /_ . (10.16) and (10.17) are to be understood in the same way. 
In each case, the two upper signs belong together, as do the two lower signs . 
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/+ thus increases (or decreases) the eigenvalue m by 1 . Leaving off the normalisation 
factor, we can therefore write 

/+ Fi^m = Fi^rn±\' (Numerical normalisation factor) . (10.21) 

Equation (10.13a), which requires that cannot be larger than cu/, applies here. 
Therefore the series of new eigenfunctions l+Fi^^n must terminate at a maximum 
m = and at a negative, minimum m = mmm. Thus, just as in the case of the 
harmonic oscillator, we must require that 

=0 ( 10 . 22 ) 

and 

LF/^.=0. (10.23) 

If we apply f_ to the left of (10.22), and make use of the relations 

= / 1 + + kh = P- ur,± h) (10.24) 

and the fact that Fi^^ is an eigenfunction of P and 4 , we obtain the basic equation 
P ~ ^ i^l~ ^max~~ ^maix)Fi^rn^^y^ “ ® • (10.25) 

In analogous fashion, by applying /+ to (10.23), we obtain 

= ^^(CU/- Wmin + '«min)F/,„,^.^ = 0 . (10.26) 

Since the eigenfunctions Fi^f^ are not zero, the factors by which they are multiplied 
must vanish. It must therefore hold that 



^max(^max 1) “ ^min(^min 1^) (Oi» (10.27) 

This can be rearranged to 

(^max”l" ^min)(^max'“ 1) ~ ^ • (10.28) 

Since ^ it follows that the second factor in (10.28) must be different from 
zero, and therefore that the first factor is equal to zero. From this. 



^max ~ ^min * 



(10.29) 



As we have seen, each application of /+ to Fi^fn increases the eigenvalue m by 1 . There- 
fore the difference must be an integer. If follows from (10.29) that 



^max ~ 



integer 
2 " 



(10.30) 



So far we have only made use of the fact that Fi^^ satisfies the Eqs. (10.6) and 
(10.7), and that the commutation relations (10.14) apply. As we shall see later, we must 
require for the orbital motion of the electron that all values of m, and thus in particular 
mmaxj must be integers. Interestingly, the electron and also a few other elementary 
particles have their own angular momentum, which is independent of the orbital 
angular momentum, for which m^ax = y* This independent angular momentum is 
called “spin”. We shall return to it in Sect. 14.2.1. 
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If we set m^ax = h there are 2/ + 1 integers m between + / and - / which satisfy the 
condition 






(10.31) 



From (10.27) we know that the parameter a>i, which appears in (10.6), is 



(Oi —/(/+!), 



(10.32) 



The eigenvalue of the operator “angular momentum squared” is thus 



Wmax('Wmax+l)^ =/(/+l)^ • 



(10.33) 



With these results, we can give the original equations (10.6) and (10.7) with their 
exact eigenvalues 



hm~ • 



(10.34) 

(10.35) 



The application of 1+ to leads to a new function for which the normal- 
isation factor N remains undetermined: 



^/, m + \ ~ FA/ + Fl, m • 



(10.36) 



It can be shown that 



N = — , 

h l/(/-w)(/+m + l) 



(10.37) 



We again proceed in analogy to the harmonic oscillator. There we constructed the 
eigenfunctions in space, in that we applied the operators b and successively to the 
ground state. Here we do exactly the same. First, one can express the angular momen- 
tum operators, which were given in Cartesian coordinates according to (9.61), in polar 
coordinates. As can be shown mathematically, the result is 



^ _ h 8 
i 60 



(10.38) 



lx= - — ( sin0 + cot0cos0- 



(10.39) 



ly = — ( COS0 cot0sin0- 



(10.40) 



Using (10.35) and (10.38), we represent as the following product: 
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where we write the second factor on the right in the form 

Pr(cos0) (10.42) 

for later use. If we increase 0 by 2 tt, we must naturally obtain a single-valued function 
F/ This can only be guaranteed if m is an integer. Therefore the odd multiples of 1/2 
which would satisfy (10.30) are excluded. 

We now calculate for m= ~l from the condition (10.23). If we substitute 
(10.39) and (10.40) into this equation, we obtain in simple fashion 

The exponential functions can be removed from the second equation to give 

9//.-/W =/cot6>//_/(6>). (10.44) 

66> 

The solution of this differential equation is 

//,_/(0) = C(sin0)', (10.45) 

as the reader can be convinced by substitution. Here C must be determined by the nor- 
malisation. The condition 



90 



■ - / cot 0 



//,-/W = 0. 



(10.43) 



y ]\F\^s\nOded0=\ 



0 0 



(10.46) 



yields the coefficient C after carrying out the integration: 



C = 



1 1/(2/+ 1)! 



1/4^ /! 2 ^ 

If we now apply to F/ consecutively in the form 
9 



r+F,,„ = /ie'^ 



90 



- w cot 0 



P/,m > 



(10.47) 



we can construct all the angular momentum eigenfunctions. 

In the following, we give the expressions obtained thus for / = 0, 1, and 2. The func- 
tions Fi „, are normalised according to (10.46). They are given both as functions of the 
angul ar coordina tes 0 and 0 and as functions of the Cartesian coordinates x, y, z (with 
r = ]/x^+y^+z^); they are denoted in the standard notation by Y/ „,(0, 0). 



/ = 0 



Yo,o- 



1 



1/4^ 



(10.48) 
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10.3 The Radial Wavefunctions in a Central Field* 

Before we turn to the problem of hydrogen, let us consider the general case of an 
electron in a centrally symmetrical potential field V(r), of which we assume only that it 
vanishes at infinity. The starting point is then (10.8), which we repeat here: 

1 d (2 d\^ h^l{l+\) 

Into dr \ dr ) Im^r^ 



+ K(/-) /?(/■) = £■;?(/•). 



(10.51) 



Let us rewrite the underlined differential expression: 

d^ 2 d 

7 

dr r dr 

and multiply the equation by -IrriQ/h^ to obtain 

d^R 2 dR 

dr r dr 

where we have used the abbreviations 

^_2nto for £'<0 

I for E>0, 

K = l^V(r). 
h 




(10.52) 



(10.53) 



(10.54) 
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We shall now see what happens to the solution R(r) if we allow r to become very 
large. We begin with the function 



R = 



u(r) 

r 



(10.55) 



If we substitute this in (10.53), we obtain 



dr 



«(/•)+ 



A - V(r) - 



/(/+!) 



u{r) = 0 . 



(10.56) 



Since both V and 1/r^ go to zero at infinity, we neglect these two parameters. The 
remaining equation has two types of solution: 



1) £>0, i.e. A>0. 

In this case the general solution of (10.56) is 

M = Cie‘*^+C2e-‘*^'' (10.57) 

and thus the original solution 7? (r), according to (10.55), is 

/? = i_(ci €**'■+ cze-**"). (10.58) 

r 

To illustrate the meaning of this solution, let us imagine it to be multiplied by the 
time-dependence factor exp(-icu0 which would occur in the solution of the time- 
dependent Schrodinger equation. We see then that exp(iA:r) exp(-ia>0 represents 
a spherical wave propagating outwards, while r~^ exp(- iA:r) exp(- ia>0 is a spherical 
wave coming inwards. These spherical waves which come in from infinity and travel 
outwards again correspond to the hyperbolic orbits in the classical Kepler problem. 
Now let us investigate the case 

2) £'<0, i.e. ^ <0. 

Then the solution of (10.56) is 

w = Cle^''+C2e■'^^ (10.59) 

Since the solution naturally must not become infinite at large distances, which the 
exponential function exp(/cr) would do, we must require that the coefficient Ci = 0. We 
then obtain according to (10.55) a solution of the type 



i? = — e-''^ (10.60) 

r 

Since the absolute square of R represents the probability of finding the particle, and 
this quantity decreases exponentially for increasing r, we see that in (10.60), the 
electron is localised within a certain area in space. This is the quantum mechanical 
analogy to the closed elliptical orbits of classical physics (see Sect. 8.9). 
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We will now attack the problem of solving (10.51) for the case of a Coulomb potential 
Ze^ 

V=- . (10.61) 

47i6or 

For this purpose it is convenient to use dimensionless quantities. We thus introduce a 
new distance variable 

e = 2Kr (10.62) 

with K defined by (10.54). Corresponding to this, we introduce a new function R(g), 
which is related to R(r) by R(r) = R(2Kr) =R(g). We then divide (10.53) by (4«r^) 
and obtain: 




4 KQ Q / 



(10.63) 



in which we have used the abbreviation 



ntoZe^ 

h^4neo 



(10.64) 



The primes on R denote derivatives with respect to q. 

Having seen before that the wavefunction decays exponentially at large distances, it 
would appear reasonable for us to use an exponential function as trial solution. It will 
later prove useful to adopt the form 



R = e-^^^v(Q). 



(10.65) 



If we insert this trial solution in (10.63) and carry out the differentiation of the 
exponential function and of the function v{g), we obtain 




B A 1 /(/+!) 



v = 0. 



( 10 . 66 ) 



It is shown in the study of differential equations that (10.66) is satisfied by a trial 
solution in the form of a power series in g, which we will express in the convenient form 



v = g^ Ia^g''= lOyg 

v=0 v=0 



(v + ju) 



(10.67) 



in which it is assumed that ao =j= 0. 

In this expression, the exponent ju and the coefficients are still to be determined. 
We insert the trial solution (10.67) in (10.66), re-order according to powers of q, and 
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require that the coefficient of each power of q should be independently equal to zero. 
The lowest power which occurs is The corresponding coefficient is 

QQfjLiji— V) Q qT, — ciqI{1 V) = 0 , ( 10 . 68 ) 

Since we have assumed that Qq is nonvanishing, the common factor of must be 
zero, i.e. 

//(//+ 1) = /(/+!). (10.69) 

Of the two possible solutions ju = I and ju= - /- 1, only the first is usable for us, since 
the other solution leads to a function v which diverges at the origin (10.67), causing the 
trial solution for R also to diverge (10.65); however, we require the solutions of the 
Schrodinger equation to be well-behaved in the entire range. 

We now investigate the coefficients of the higher powers of ^(v=l=0). For we 

find 

^7,(v + /)(v+/-l) + a,2(v+/)-6r,/(/+l)-a,_i(v+/-l) + drv_i(AZ-l) = 0, (10.70) 

where the abbreviation 

n = B/k (10.71) 

has been employed. Relation (10.70) connects the coefficient with the preceding 
coefficient a^_x- We thus obtain from (10.70), after an elementary rearrangement, the 
recursion formula 






v+l-n 
v(v + 2 /+ 1) 



(10.72) 



This recursion relation permits two quite different types of solutions, depending on 
whether the chain of the cr’s is terminated or not. If it is not, the sum in (10.67) contains 
infinitely many terms, and it may be shown mathematically that then v{q) is practically 
equal to an exponential function which diverges at infinity. We must therefore restrict 
ourselves to the case when the chain of the a’s does terminate; this is in fact possible if n 
is an integer. We then obtain a cutoff for v = Vq, where 



Vq= n- 1 . 



(10.73) 



Since we must have Vq ^ 1, we obtain from this a condition for /: 



(10.74) 

In the following, we shall refer to n as the principal quantum number and to / as the 
angular momentum quantum number. According to (10.74), the angular momentum 
quantum number cannot be larger than n-\. 

We now calculate the energy value, which, as we will see immediately, is already 
determined in principle by our assumptions. For this purpose, we express E in terms of 
K (10.54); K is, however, already determined by (10.71). 
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In (10.71), as we have just seen, n is an integer, « = 1, 2, . . . . Furthermore, B is 
defined in (10.64). We thus obtain for E: 



J_ 



(10.75) 



If we think back through the whole derivation, we see that the energy values E came 
about through the requirement that the series (10.72) be terminated, or, in order to find 
the actual solution, that the wavefunction should vanish at infinity, n is allowed to take 
on integral values 1, 2, 3, . . . in (10.75), so that we obtain the energy level diagram of 
Fig. 8.4. The same energy values have already been derived in Chap. 8, starting with the 
Bohr postulates. 

For E>Q, i.e. for non-bound states, the energies form a continuous distribution of 
values. We will not give the corresponding wavefunction here. 

Since the series (10.67) has a cutoff, v{q) is a polynomial. If we recall the trial 
solution for R{q) and the abbreviation for q, (10.62), 



R = , 



(10.76) 



we finally arrive at an expression for the original R of the form 
Rnj{r) = K„r)r‘\}Jtl{2K„r ) . 



(10.77) 



The various quantities have the following meanings: 

the normalisation factor, which is determined by the condition 

]Rl,{r)r^dr=\. (10.78) 

0 

(The factor in the integrand results from the use of spherical polar coordinates.) 

K„ has the dimensions of an inverse radius and is given explicitly by (10.71, 64) 



1 moZe^ 
n h^Anso 



(10.79) 



is the mathematical symbol for the polynomial which occurs in (10.77), whose 
coefficients are determined by the recursion formula (10.72). It may be shown that 
^'n+x can be obtained from the so-called Laguerre Polynomials by (2/+ l)-fold 
differentiation: 



( 10 . 80 ) 

The Laguerre Polynomials, in turn, are obtained from the relation 

L„+,(e) = (10.81) 

A series of examples of (10.77) is given in Fig. 10.3 for various values of the quan- 
tum numbers. In Fig. 10.3 a, the radial wavefunction is plotted as a function of the 
dimensionless radius variable q (10.62). The parentheses (1,0), (2,0) etc. contain the 
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Fig. 10.3. a) The radial wavefunctions R(q) = R(r) of the hydrogen atom (10.77) are plotted vs the dimen- 
sionless coordinate g. The indices on the curves, (1,0), (2,0) etc. correspond to (/?,/) where n is the principal 
quantum number and / the angular momentum quantum number, b) The corresponding probability densities 
in the radial coordinate, i.e. 4ng^R^{Q) are plotted against the dimensionless coordinate o 
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values of n and / («,/). In Fig. 10.3b, Atiq^R^{q) is plotted for various values of n 
and /. R^(g)dQ gives the probability of finding the particle in a particular direction in 
space in the interval ^ . . . ^ + rfp. If, on the other hand, we wish to know the probability 
of finding the particle at the distance g in the interval g...g + dg independently of 
direction, we must integrate over a spherical shell. Since the volume of a spherical shell 
is just Ang^dg, we are led to the above quantity, ^ng^R^{g). The maxima of these 
curves is displaced to regions of greater distance with increasing quantum number n, so 
that here we see an indication, at least, of the classical orbits. 

Let us summarise our results. The wavefunction of the hydrogen atom may be 
written in the form 



Vn,i,m(r, 0, 0) = e'"’^Pr(cos 8)R„j{r) . (10.82) 

Here n is the principal quantum number, / the angular momentum quantum number, 
and m the magnetic quantum number or directional quantum number. These quantum 
numbers can assume the following values: 

A2= 1,2,... , 

(10.83) 

Some examples for the density distribution of the electron ( = probability density 
distribution | ii/nj,m{r, 0, 0) p) are represented in Fig. 10.4. 

The density of points shown here was calculated by computer. It represents the 
probability density of the electron. Since the hydrogen functions are partially complex, 
combination of functions which belong to and -m yields real functions. These 
linear combinations are also solutions of the Schrodinger equation of the hydrogen 
problem. They still have the quantum numbers n and /, but they are no longer eigen- 
functions for the z component of the angular momentum, so that this quantum number 
is lost. Figures 10.4a, b and e represent solutions with / = 0, which yield spherically 
symmetrical distributions. The sections c, d, f and g represent 1=1. Here one notices 
the dumbbell shaped distribution along one axis. There is a further linear combination 
possible in each case, but not shown here, in which the long axis of the dumbbell would 
lie along the third coordinate. Sections h and i represent 1=2, with m = 0 in h, and / 
represents a linear combination of m = ±1. Figure 10.4 does not show the wavefunc- 
tions with 1=2, m = ±2. 

The energy corresponding to (10.82) is given by (10.75). It clearly depends only on 
the principal quantum number n. Since each energy level (with the exception of 
n = 1) contains several different wavefunctions, these levels are called degenerate. This 
degeneracy is typical of the hydrogen atom problem with the Coulomb potential. 

The degeneracy with respect to / is lifted, i.e. the energy levels become dependent 
upon /, if the potential no longer has the form - const/r, but is still spherically sym- 
metric (Sect. 11.2). We will be led to consider effective departures from the Coulomb 
potential for all atoms with more than one electron (see below). The / degeneracy is also 
lifted even for hydrogen if we treat the problem relativistically, which is necessary for 
the exact treatment of the spectra (Sect. 12.11). The m degeneracy can only be lifted by 
superimposing a «OAi-spherically symmetric potential on the central potential of the 
atom, i.e. an electric or a magnetic field (Chaps. 13 and 14). 
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Fig. 10.4 a -i. The density distribution (= localisation probability 
density |^p) of the electron in the H atom represented by the 
density of dots (according to H. Ohno). The wavefunctions 
represented are: 

a) (10.82), « = 1, /=0, m = 0 

b) (10.82), n = 2J=0,m = 0 

c) (10.82), « = 2, /= 1, m = 0 



d) linear combination ( ^ 2 , i , i + i 



1) 



1/2 



e) (10.82), « = 3, / = 0, m = 0 

f) (10.82), « = 3, /= 1, m = 0 

g) linear combination iWi \ \ + \ _i)— 2— 

h) (10.82), « = 3, / = 2, m’=0 ’’ V} 

i) linear combination i \ + 2 _i)— ^ 

1/2 

The linear combinations given are also solutions of the Schrddinger 
equation of the hydrogen problem with the energies but they 
are not eigenfunctions of 
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Problems 

10.1 Calculate the expectation values of the kinetic and potential energies 

a) for the ground state of the hydrogen atom, « = l,/ = m = 0, 

b) for the wavefunctions n = 2, 1 = 0, m = 0 and n = 2, I = 1, m = ±1,0. 

Hint: Use spherical polar coordinates, so that for the volume element dV, 
dV= sin6d9d0r^dr. 

10.2 As will be discussed in Sect. 15.2.3, in quantum mechanics dipole matrix 
elements between two states with the wavefunctions and if /2 are defined by 

D = j y/fer y/idxdydz . 

Why is Z> a vector? Calculate the components of D when 

a) y/^=xi/^ = 

b) y/x = = ¥2,0,0 

or ¥i=¥ 2 ,\,o 

or ¥ 2 =¥ 2 ,\,±\^ 

Here y/nj,m is the wavefunction of the hydrogen atom with the quantum numbers n, I 
and m. 

10.3 Calculate k (10.79) and (10.75) numerically for the first three values of n for 
the hydrogen atom. 



10.4 Using the ground state of hydrogen as an example, we discuss here the variation 
principle of quantum mechanics. This says, in general, that the wavefunction y/ of the 
ground state of a Schrodinger equation .yfy/ = Eyj can be found (aside from solving the 
equation directly) by minimising the expectation value of the energy by a suitable 
choice of ^ = j y/^M'y/dxdydz = min. y/ must simultaneously satisfy the additional 
condition that \y/*y/dxdydz= 1. 

This principle can also be used to estimate wavefunctions, and especially energies. 

Problem: a) Take the trial solution y/ = Nqx ^{- Calculate the normalisation 
factor N. Then calculate £ as a function of ^o, and minimise ^ by a suitable choice of 
A*o. Then compare E^^^ with the exact value of the energy, 
b) Repeat the procedure for y/ = Aexp( - r/ro). 



10.5 Solve the one-dimensional Schrodinger equation 



tfV(-y) 

2 mo dx^ 



c\ , C2 



± y/(x) = Eyj 



for x>0, Ci>0, C 2 > 0 , £*<0. 
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Hint: First examine the limiting case oo and determine the asymptotic form of \i/{x). 
Then try the solution 



il/{x) = x'^e 



where o= y + |/-f+C2 with C2 = 2 atiq C2/h^ and e= - 2 ntoE/h^. 

Calculate g{x) with a power series which, however, must be terminated. Why? 




11. Lifting of the Orbital Degeneracy in the Spectra 
of Alkali Atoms 



11.1 Shell Structure 



After the spectra of atoms with only one electron, the next simplest spectra are those of 
alkali atoms. 

The alkali atoms have a weakly bound outer electron, the so-called valence electron, 
and all other (Z- 1) electrons are in closed shells. What the atomic-physical meaning of 
a closed shell is, we will discover later. At present we shall only say that even when 
several electrons are bound to a nucleus, their individual electron states can be charac- 
terised by the three quantum numbers n, I and m, but the corresponding energies are 
strongly modified, with respect to the one-electron problem, by the interactions of the 
electrons with each other. The Pauli principle (Sect. 17.2) says that a state characterised 
by specific values of n, I and m can be occupied by at most two electrons. In the ground 
state of an atom, the states with the lowest energies are naturally the occupied ones. A 
particular state of occupation of the energy levels or terms of an atom by electrons is 
called the electron configuration of the atom in that state - in this case, the ground 
state. A closed shell or noble gas configuration occurs whenever the next electron to be 
added would occupy the s state of the next higher principal quantum number n. It is not 
necessary that all the states belonging to lower principal quantum numbers be filled; 
more about this will be said in Chap. 20. The electrons in the closed shells are closer, as 
a rule, to the nucleus than the valence electron, and are more strongly bound. The total 
angular momentum of a closed shell vanishes. The closed shell is spherically sym- 
metrical and is especially stable. 

How is this known? Firstly, from chemistry: all alkali metal atoms have a valence of 
one. Each alkali metal is preceded in the periodic system by a noble gas, each of which 
has one electron fewer and has a particularly stable electron configuration - a closed 
shell. These gases are chemically inactive. Compared to those of their neighbours in the 
periodic table, their ionisation potentials are large. The neighbours with one more 
nuclear charge unit, the alkali metals, have very low ionisation potentials. For example, 
the ionisation energy of the noble gas helium is 24.46 eV. The next element in the 
periodic table, the alkali metal lithium, has an ionisation energy of only 5.40 eV. The 
ionisation energies of the heavier alkali metal atoms are even lower, as can be seen from 
Table 11.1. The table also shows that the ionisation energy for the removal of the 
second electron from an alkali metal atom is very large, because the electron configura- 
tion of the singly charged positive ion is a closed shell. In Fig. 11.1, the simplified term 
diagrams of the alkali metals are compared to that of the H atom. 

The comparison shows that in the alkali atoms, the / degeneracy is lifted. States 
with the same principal quantum number n and different orbital angular momentum 
quantum numbers / have different energies. Relative to the terms of the hydrogen 
atom, those of the alkalis lie lower - this means a larger (negative) binding energy - 
and this shift increases, the smaller / is. For larger principal quantum numbers, i.e. 
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Table 11.1. Work of ionisation for the elements with Z = 1 to Z = 20. Values are given for the neutral atom, 
and for singly, doubly and triply charged ions. The ionisation energy is always especially large for a noble gas 
configuration (closed shell). It is especially low if there is only one electron more than a noble gas configura- 
tion which is indicated by bold-face numbers 



Element 


Work of ionisation [eV] for the transition from the - 




neutral atom to 
singly charged 


singly to 
doubly charged 


doubly to 
triply charged 


triply to 

quadruply charged 


iH 


13.59 


_ 


_ 


_ 


2 He 


24.5 


54.1 


- 


- 


sLi 


5.4 


75 


122 


- 


4 Be 


9.3 


18.2 


154 


217 


sB 


8.3 


25.1 


38 


259 


6 C 


11.3 


24.5 


48 


64.5 


,N 


14.6 


29.6 


47 


77.4 


80 


13.6 


35.2 


55 


77.4 


9 F 


17.4 


34.9 


62.7 


87.3 


,oNe 


21.6 


41.0 


63.9 


96.4 


„Na 


5.14 


47.3 


71.7 


98.9 


aMg 


7.64 


15.0 


80.2 


109.3 


13AI 


5.97 


18.8 


28.5 


120 


14S1 


8.15 


16.4 


33.5 


44.9 


15P 


10.9 


19.7 


30.2 


51.4 


16 ^ 


10.4 


23.4 


35.1 


47.1 


I 7 CI 


12.9 


23.7 


39.9 


53.5 




15.8 


27.5 


40.7 


ca. 61 


19K 


4.3 


31.7 


45.5 


60.6 


2 oCa 


6.1 


11.9 


51 


67 
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Fig. 11.1. Simplified term diagrams for the alkali metal atoms, showing the empirical positions of the most 
important energy terms. The principal quantum number n is indicated by numerals, the secondary quantum 
number / by the letters 5, P, Z), and F. For comparison, the levels of the H atom are given on the right 
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greater orbital radii, the terms are only slightly different from those of hydrogen. Here 
also, however, electrons with small / are more strongly bound and their terms lie lower 
in the term diagram. This effect becomes stronger with increasing Z. We would like to 
understand this effect, at least qualitatively. 



11.2 Screening 



In order to understand the term diagrams of the alkali atoms, we will use the following 
model (Fig. 11.2): 




Fig. 11.2. Model of an alkali atom. The valence electron is screened 
from the nuclear charge +eZ by the (Z- 1) inner electrons 



A “valence electron” is located at a relatively large distance r from the nucleus. It 
moves in the electrostatic field of the nuclear charge + eZ, which is for the most part 
screened by the (Z-1) inner electrons. We describe the screening effect of the inner 
electrons together with the nuclear potential by means of an effective potential Feff(r) 
for the valence electron. In this way we reduce the original many-body problem to a 
single-particle system, and we can treat the energy levels of an alkali atom as terms of a 
single-electron atom. 

The shape of the effective potential Feff(r) is shown schematically in Fig. 11.3. If 
the valence electron moves at a great distance from the nucleus, its potential energy is 
-eV(47T£o^)- The nuclear charge which attracts the valence electron is in this case 
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compensated down to one unit of charge by the inner electrons. However, the nearer 
the valence electron approaches the nucleus, the more it experiences the unscreened 
nuclear potential. The potential energy approaches V = -Ze^/iAne^r), The effective 
potential Feff(r) is no longer proportional to r~^. This proportionality was, as we recall 
(Chap. 10), responsible for the / degeneracy. 




Fig. 11.4. Variation in screening for electrons with 
different radial probability densities. In the Som- 
merfeld model it is intuitively clear that electrons 
with “diving orbits” approach the nucleus closely 
and are affected at least partially by the unscreened 
nuclear potential. It has to be remembered, 
however, that in the modern quantum theory, the 
electrons are described as charge clouds rather than 
as orbiting particles 



In the Sommerfeld picture, the so-called diving orbits demonstrate especially clearly 
that electrons with differing orbital angular momenta, i.e. differing orbital shapes, 
experience different degrees of screening. This is illustrated in Fig. 11.4. Quantum 
mechanically, this picture remains valid to a large extent. In Chap. 10 it was shown that 
the probability density of the electrons in the neighbourhood of the nucleus decreases 
in the order / = 0, 1 , 2, . . . . The s electrons are thus most strongly affected by the un- 
screened field of the nucleus. For a given principal quantum number n, the energy 
terms of the s electrons are therefore shifted the most strongly to negative values 
relative to the H atom (Fig. 11.1). 



11.3 The Term Diagram 



For the alkali atoms, we thus obtain a term diagram like that shown in Fig. 11.5 for 
lithium. This term diagram permits a classification of the spectral lines to series, if one 
employs the additional selection rule for optical transitions Al= ±1, i.e. in an optical 
transition, the quantum number / must change by 1 . Such selection rules will be treated 
in detail in Chap. 16. 

The series in the emission spectra of the neutral alkali atoms can be described by 
series formulae similar to the Balmer series formula. For the energy terms / which 
are determined by the quantum numbers n and /, an effective principal quantum 
number «gff may be defined, so that, e.g. for sodium we have 



Efi^i 



niff 



. — — 



1 ] 
[n-Ain,l)f ) 



Here the multiplication by the factor he is necessary if the Rydberg number i?Na is 
measured in cm~\ as is customary. Here nQff=n-A{n,l) is a principal quantum 




Energy E [eV] 
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Fig. 11.5. Term diagram of the lithium atom 
with the most important transitions. This is 
called a Grotrian diagram. The term symbols 
given above the figure will be explained in 
detail in Chaps. 12 and 17. At the right, for 
comparison, the term scheme of hydrogen is 
shown 



number, in general not an integer, and A{n,l) = n- «eff is the so-called quantum defect 
associated with the quantum numbers n and /. The empirically determined numerical 
values for the quantum defects (see Table 1 1 .2) are largest for 5 electrons, decrease with 
increasing orbital angular momentum quantum /, and are largely independent of the 
principal quantum number n. They increase down the column of alkali atoms from 
lithium to cesium, or with increasing nuclear charge number Z. These quantum defects 
are empirical expressions of the different degrees of screening of the 5, p, d, etc. elec- 
trons by the electrons of the inner shells. 

For the sodium atom, the decomposition of the total spectrum into series is repre- 
sented in Fig. 11.6. Figure 11.7 shows the transitions in the form of a Grotrian 



Table 11.2. Quantum defects A(n,l) for the spectra of the Na atom [from F. Richtmyer, E. Kennard, 
J. Cooper: Introduction to Modern Physics, 6th ed. (McGraw-Hill, New York 1969)]. These are empirical 
values 





Term 


« = 3 


4 


5 


6 


7 


8 


/ = 0 


5 


1.373 


1.357 


1.352 


1.349 


1.348 


1.351 


1 


p 


0.883 


0.867 


0.862 


0.859 


0.858 


0.857 


2 


d 


0.010 


0.011 


0.013 


0.011 


0.009 


0.013 


3 


f 




0.000 


-0.001 


-0.008 


-0.012 


-0.015 
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Wavelength K [nm] 



250 300 400 600 800 1000 2000 




5 4 3 2 1 0 



Energy E [eV] 

Fig. 11.6. The three shortest-wave spectral series of the sodium 
atom. The series limits are indicated by shading. The emission spec- 
trum is a composite of these series. In absorption spectra, normally 
only the principal series is observed, because in the ground state of 
the Na atom the highest occupied term is the 35 term. The yellow 
colour of sodium lamps is due to the longest-wave resonance line of 
the main series, the transition 3s<^3p. This is the sodium D line, a 
terminology which has been retained for historical reasons 



Fig. 11.7. Term scheme (Grotrian diagram) of the sodium atom. 
Some of the shortest-wave transitions from the principal series, the 
two secondary series and the Bergmann series have been included. 
The numbers in the diagram indicate the wavelength of the transi- 
tion in Angstrom units. The term symbols indicated on the upper 
edge of the figure also represent the quantum numbers for the 
multiplicity and the total angular momentum. These are explained 
in Chaps. 12 and 17 





Wavenumber v [10'^ cm 
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and the Bergmann (fundamental) series with transitions from/ to d electron terms: 



[n-A(n,W 



n ^ «o+ 1 



is again the Rydberg number of the sodium atom and «o is the integral principal 
quantum number of the lowest state. This is 2 for Li, 3 for Na, 4 for K, 5 for Rb and 6 
for Cs. We are jumping slightly ahead in saying that the valence electron of the alkali 
atoms begins a new shell in each element. The principal quantum number of the ground 
state therefore increases by one in each successive alkali element of the periodic system. 

The names for the series and the system of indicating the electrons with orbital 
angular momentum 0, 1, 2, 3, 4, ... as s,p, dj, ^. . . are historic, p is for principal, 5 for 
sharp, d for diffuse and / for fundamental. 

Under normal conditions, only the principal series is observed by absorption spec- 
troscopy, because unless the temperature is extremely high, only the ground state of the 
atoms is sufficiently populated for transitions into higher states to be observed. The 
lines of the principal series are thus resonance lines. The best known is the D line of 
sodium, which is the transition 3s- 3p. The sum of the 5 terms can also be designated 
S, and of the p terms, P, so that the sodium series can be written: 



Principal series 3 S 
Secondary series 3P*^nS 

3P*^nD with n^3 . 



Capital letters are used for terms which apply to several electrons in an atom, and 
lower case letters for the terms for individual electrons. In the alkali atoms, which have 
only one valence electron, the two notations are equivalent. 

The screening effect of the inner electrons can be quantitatively calculated, if one 
knows their charge distribution with sufficient accuracy. Qualitatively, we wish to 
demonstrate the effect of the nuclear charge on a single 3d or 4s electron in the atoms 
H(Z=l)andK(Z=19). 

In the H atom, the charge cloud of a 3rf electron is, on the average, closer to the 
nucleus than that of a 45 electron (Fig. 11.8). Therefore, the 3d electron is more 
strongly bound to the H atom. It is different, however, in the K atom. The configura- 
tion of the atomic core, i.e. the noble gas configuration of Ar, consists of two 5 elec- 
trons with « = 1 (symbol l5^), two s electrons with n = 2 (symbol 25^), six electrons 




0 4 8 12 16 20 24 28 32 36 

Bohr radii r 



Fig. 11.8. Radial probability densities for a 
45 and a 3t/ electron in the H atom. The 45 
electron is, on the average, farther from the 
nucleus, but the probability of finding it near 
the nucleus is greater than the probability of 
finding the 3 d electron there 
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with n = 2 (symbol 2/?^), two s electrons with n = 3 (symbol 3 5'^) and six p electrons 
with n = 3 (symbol 3p^): 

\s^2s^2p^3s^3p^ - or [Ar] for argon . 

Now the question is, does K, with one more electron, have the configuration 
[Ar] 45"^ or does [Ar] 3d^ have a greater binding energy? Is the 19th electron added as a 
As ox di 3d electron? 

From our consideration of the H atom, we would predict that the [Ar] 3d^ con- 
figuration is more stable. However, it must now be determined what the effects of 
shielding are on the 3d and 45 electrons. Because the 45 electron has a higher prob- 
ability of being very close to the nucleus, and thus unscreened, it turns out that the 45 
energy level is energetically somewhat lower than the 3 d. The 20th electron is also an s 
electron; see Table 3.1 . The element following potassium in the periodic table, calcium, 
has the configuration [Ar] 45^. It thus becomes clear how decisively the screening 
affects the binding energies of the outer electrons, in a manner dependent on the orbital 
angular momentum quantum number /. 




wavelengths for a few transitions (in nm) are indicated. The term symbols on the upper edge of the diagram 
are explained in Sect. 12.8 and Chap. 17. Right, Term scheme for the potassium atom in the infrared, visible, 
ultraviolet and x-ray regions. The term symbols used in this diagram are explained in Chaps. 12 and 17. (One 
should observe that the energy ranges in the two halves of the figure are different.) The x-ray spectrum also 
includes terms with lower principal quantum numbers than the visible spectrum. Terms with « = 1, 2, 3 . . . 
are referred to in the x-ray region as the K,L,M . . . shells, see Chap. 18 



Energy E [eV] 




Problems 



179 



11.4 Inner Shells 

So far we have treated only the optical spectra of the alkali atoms. The valence electron 
could only have the principal quantum number n^lfox Li, ^3 for Na, ^4 for K, etc. 
The states with lower principal quantum numbers were completely occupied. Transi- 
tions involving these inner electrons were not discussed. However, they are also 
possible. Since the inner electrons are more strongly bound, such transitions take place 
at higher energies. We will introduce such transitions later, in the discussion of x-ray 
spectra (Chap. 18). 

Figure 11.9 shows, in addition to the optical term scheme for the valence electron of 
the K atom, a complete term scheme. This includes the transitions in the x-ray region of 
the spectrum, in which an electron is removed from a closed inner shell and replaced by 
an electron from further out. 



Problems 

11.1 The energy levels of the valence electrons of an alkali atom are given, to a good 
approximation, by the expression 

E„ = -Rhc- \/[n-A(,n,l)\^ . 

Here A(n,l) is the quantum defect (which depends on the values of n and / of the 
valence electron in question). For lithium and sodium, A(nJ) have been measured: 





5 


P 


d 


Li (Z = 3) 


0.40 


0.04 


0.00 


Na (Z=ll) 


1.37 


0.88 


0.01 



Calculate the energy of the ground state and the first two excited states of the valence 
electron in lithium and sodium. 

11.2 The ionisation energy of the Li atom is 5.3913 eV, and the resonance line 

(25 2/7) is observed at 6710 A. Lithium vapour is selectively excited so that only the 

3 p level is occupied. Which spectral lines are emitted by this vapour, and what are their 
wavelengths? 

Hint: Start from the fact that the quantum defect is independent of «, the principal 
quantum number. 

11.3 Explain the symbols for the 3 transition in sodium. How many lines 

can be expected in the spectrum? 
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12.1 Introduction and Overview 

We have not yet discussed the magnetic properties of atoms. It turns out that the study 
of these properties yields a deeper insight into the shell structure of atoms. 

The impetus to study the magnetic properties was given by a few fundamental ex- 
periments, which we shall discuss in this chapter. The most important are 

- Measurements of the macroscopic magnetisation and of the gyromagnetic proper- 
ties of solids, known as the Einstein-de Haas effect. 

- Measurements of directional quantisation and of the magnetic moments of atoms in 
atomic beams, made by Stern and Gerlach. 

- Observation of the so-called fine structure in the optical spectra of atoms. 

We shall begin with the third point. Many of the lines in the spectra of alkali atoms 
are double, and are called doublets. They occur because all the energy terms E^j of 
atoms with single valence electrons, except for the s terms (energy levels with no orbital 
angular momentum), are split into two terms. This splitting cannot be understood in 
terms of the theory discussed so far. It is fundamentally different from the lifting of 
orbital degeneracy discussed in the last chapter. If the orbital degeneracy has already 
been lifted, there must be a new effect involved, one which has not yet been taken into 
account. Let us take as an example the D line in the spectrum of the sodium atom, i.e. 
the transition 3 P ^ 3 S (Fig. 11.7 and 12.1). With sufficient spectral resolution, one can 
see two lines: Dj = 589.59 nm A 16956 cm~^ and D 2 = 588.96 nm ^ 16973 cm"^ In the 
following we shall often use this pair of lines as an example for explanation and experi- 
mental demonstration of spectroscopic results. Like the Balmer series of the H atom, 
the sodium D lines are especially suitable for demonstration of basic concepts in atomic 
spectroscopy - so much so, that they have become the “guinea pigs” of the field. 




Na lamp 



Arrangement 
of prisms 



Fig. 12.1. Arrangement for spectral separation of the two com- 
ponents Dj and D 2 of the sodium D line. With this arrangement, 
the splitting can easily be demonstrated in the lecture hall by 
replacing the screen with a television camera. To separate the 
lines distinctly, one needs two commerically available straight- 
through prisms 




Fig. 12.2. Arrangement for observation of the sodium D lines by absorp- 
tion (as the so-called Fraunhofer line). The continuous spectrum of an arc 
lamp or xenon high-pressure lamp is spread out by a prism. A gas flame 
made yellow by addition of NaCl, or better still, sodium vapour from a 
heated piece of sodium metal, absorbs the light of the D line from the 
continuous spectrum. On the screen, therefore, the line is seen as a black 
band on the continuous spectrum 
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To explain the doublet structure, one needs three additions to our previous picture: 

- A magnetic moment fXi is associated with the orbital angular momentum L 

- The electron also has a spin s. It too is associated with a magnetic moment, 

- The two magnetic moments Ps pi interact. They can be parallel or antiparallel to 
each other. The two configurations have slightly different binding energies, which 
leads to the fine structure of the spectrum. 

Two demonstrations of the yellow sodium lines are shown in Figs. 12.1 and 12.2; 
other experiments follow in Chap. 13. 



12.2 Magnetic Moment of the Orbital Motion 

An electron moving in an orbit is equivalent to a circular electric current. We know 
from electrodynamics that a circular electric current generates a magnetic dipole field. 
We expect that the orbiting electron will do the same, and it does in fact have a 
magnetic dipole moment. This we shall now calculate. 

The magnetic dipole moment of a conducting loop is defined as 

H = IA [Am^], (12.1) 

where / is the current, and ^4 is a vector which is perpendicular to the plane of the con- 
ducting loop and which has a magnitude equal to the area enclosed by the loop. Thus 
the vector p is also perpendicular to the plane of the loop. 

If we bring this magnetic dipole into a homogeneous magnetic field B, a torque t is 
applied to the dipole: 



r = pxB, (12.2) 

The magnetic potential energy of the dipole is (Fig. 12.3) 

a 

^mag= I Tda= -/xBcosa, (12.3) 

n/2 

where a is the angle between p and B, 




Fig. 12.3. Calculation of the potential energy of a con- 
ducting loop in a magnetic field. The magnetic moment 
is the product of the current / and the area vector . The 
potential energy depends on the angle a between the 
normal to the plane of the loop and the direction of the 
magnetic field 
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The magnetic moment can be defined either in terms of the torque in the field (12.2) 
or the potential energy (12.3). 

In atomic and nuclear physics, the magnetic moment is often defined as the torque 
in a uniform field of strength H (not of strength B). Accordingly, 

T = ft'xH, ft' = hqIA , (12.4) 

if we indicate magnetic moments which are defined w.r.t. H by ft'. Because of the 
relation B = juoH, the induction constant = “Itt • 10“’ Vs/Am occurs in (12.4). 




Fig. 12.4. Calculation of the orbital moment. The circulating electron has 
an angular momentum / and a magnetic dipole moment ///. For a negative 
charge, the vectors / and /// point in opposite directions 



We now wish to transfer the definition of the magnetic dipole moment to the atom 
and calculate the magnetic moment of an electron with the charge q= -e which is in a 
circular orbit moving with the velocity v (Fig. 12.4). If the time for a single revolution is 
T = 2 tt/cu, a current 



I 



0 ^ 

T 



eoj 

2n 



(12.5) 



is flowing. Here we have used e (without a sign) for the elementary unit of charge. Here 
and in the following, we use a negative sign for the electron. 

The magnetic moment fx of this circular current is then, according to (12.1), 

H = lA = - \eu>r^ . (12.6) 

If we introduce^ the orbital angular momentum |/| = mvr = mcoP', we can rewrite 
(12.6) as a relation between the magnetic moment and the orbital angular momentum 



/!=-—/. (12.7) 

Into 

If the charge q is positive, the vectors and / point in the same direction; if it is 
negative, as with the electron, they point in opposite directions. Therefore (12.7) holds. 
We have introduced the symbol /Wq to make it clear that the rest mass is what is meant. 



^ The orbital angular momentum is given by /, and its magnitude by | / 1. This is to prevent confusion with the 
quantum number / of the orbital angular momentum. See also (8.28) 
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The proportionality between angular momentum and magnetic moment is also 
known as the magnetomechanical parallelism. The fact that it is valid in atoms is by no 
means self-evident, and follows from experimental observations which will be dis- 
cussed below. 

As the unit of magnetic moment in atoms, we use the strength of the moment which 
corresponds to an electron with the orbital angular momentum |/| = h/ln. This is the 
orbital angular momentum in the first Bohr orbit of the hydrogen atom in the old Bohr 
model. An electron with |/| = h/2n or h produces a magnetic moment given by the 
Bohr Magneton: 

;«B = — ^^ = 9.274078- (12.8) 

2mo 



It is an unfortunate - but, because of its wide usage, unavoidable - inelegance 
that the same symbol // is used both for the magnetic moments // and //g and for the in- 
duction or permeability constant of vacuum, //q- 

The magnetic moments of electrons are frequently given in units of /Yb. For the 
magnitude of the magnetic moment of an orbit with the angular momentum quantum 
number /, the following expression is valid: 

;«/ = ^bI//(/+1) = ^^1//(/+1). (12.9) 

2 mo 

This expression is also valid for vectors, in the form 

Mi= ( 12 . 10 ) 

h 

Equation (12.10) thus defines the g factor, which we shall often meet in the following. 
It is dimensionless and here has the numerical value = 1 . It is a measure of the ratio 
of the magnetic moment (in Bohr magnetons) to the angular momentum (in units of h). 
It was introduced by Lande, in the presence of spin-orbit coupling (Sects. 12.7, 8), in 
order to characterise the ratio of the magnetic moment (in jn^) and the total angular 
momentum (in units of h). 

With “angular momentum”, we often denote - briefly but inaccurately - the 
quantum number /, i.e. the maximum component in the z direction, l^/h. The maxi- 
mum component of fx in the z direction is then given by (//^)max = We will treat 

the g factors for other cases of the angular momentum later. They are always defined 
as the ratio of the magnetic moment to the corresponding angular momentum, in units 
of /iB and h, respectively. 



12.3 Precession and Orientation in a Magnetic Field 

An applied field with the magnetic flux density Bo acts on the orbital magnetic mo- 
ment fii by trying to align the vectors /// and Bo parallel to one another, since the 
potential energy is a minimum in this orientation (12.3). The electrons, which are mov- 
ing in their orbits, behave mechanically like gyroscopes and carry out the usual preces- 
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sion about the direction of the field. The precession frequency (Op of a gyroscope 
under the action of a torque t is 



|/|sina 



( 12 . 11 ) 



where I is the angular momentum of the gyroscope, and a the angle between the 
directions of / and Sq (Fig- 12.5). 




Fig. 12.5. Vector diagram for the calculation of the precession frequency of 
a gyroscope with angular momentum / and magnetic moment //. The angle 
between the field Bq and the direction of / (and //) is denoted by a; the vectors 
Al and t are perpendicular to / and B 



These considerations may be directly transferred to the case of the atomic 
gyroscope. The precession frequency of the electron orbit, the Larmor frequency, is 
found from (12.10) and (12.11) to be 






1^1 

|/|sina 



I / I sin Of h 



( 12 . 12 ) 



The new quantity y which we have introduced here is called the gyromagnetic ratio. It 
gives the precession frequency in a field with a magnetic flux density of 1 Vs/m^ = 1 
tesla. The sign and direction of the vectors is indicated in Fig. 12.6. As can be seen from 
(12.12), the Larmor frequency col is independent of the angle a. 

We have already seen that the orientation of the vector / in space is not random. The 
solution of the Schrodinger equation (Sect. 10.2) implies that when one axis is 
established, a component of the angular momentum is quantised. This axis can be 




Fig. 12.6. Directional quantisation: Only the projections of the vectors / 
and fii on a chosen axis z can be observed. Here the z direction is the 
direction of Bq 
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determined by a magnetic field, for example. Therefore only discrete values of a, the 
angle between B and / or /i/, are allowed. 

According to Sect. 10.2, the following holds for the components of the angular 
momentum in the z direction: 

lz = mih, with m/=0, ±1...±/. (12.13) 

Here rrii is used instead of m in Sect. 10.2. In this way we emphasise that m{ = mi) is 
associated with the orbital angular momentum, m/is the magnetic quantum number. It 
can have 2/+1 different values. Here / is again the angular momentum quantum 
number, \l | = \/l(l+\)h. The largest possible component of / in the z direction thus has 
the value / • h. 

The magnetic moment jii associated with the orbital angular momentum is corre- 
spondingly quantised. For its component in the z direction the quantisation rule is 

fj'hz = h = - ■ (12.14) 

2 mo 

The maximum value in the z direction is As a simplification (but not 

accurately), it is said that the state has the magnetic moment h ju^. 

Since // precesses around the direction of B, it is intuitively clear that in an observa- 
tion of the energy of interaction between the magnetic moment and the magnetic field, 
the X and y components of ju are averaged out over time. However, the z component 
can be observed. 

The experimental demonstration of the existence of a directional quantisation was 
provided by the Stern and Gerlach experiment (see Sect. 12.6). 



12.4 Spin and Magnetic Moment of the Electron 

The s states with orbital angular momentum / = 0 have no orbital magnetic moment. 
Therefore, a one-electron atom should be diamagnetic in the ground state, when it has 
one valence electron in an outer shell and all the others in closed shells. However, these 
atoms are actually paramagnetic. 

The reason is the existence of electron spin and the associated magnetic moment. 
Electron spin was introduced by Uhlenbeck and Goudsmit in 1925 to explain spectro- 
scopic observations. 

The splitting of many spectral lines in a magnetic field, which will be discussed later 
(the anomalous Zeeman effect) can only be explained if the electron has a spin angular 
momentum s, 



|5|= l/s(5+l)^ (12.15) 

and the associated magnetic moment 
e 

2 mo 



Ms = -9s 



(12.16) 
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where e is again the unit charge of the electron, without the negative sign. 5 = 1/2 is a 
new quantum number, the spin quantum number. The similarity of (12.16) and (12.10) 
is apparent. The two expressions differ only in that (12.16) contains the new factor 
the so-called g factor of the electron. Although the expected value for this propor- 
tionality constant on the basis of classical theory would be 1 , the value has been empir- 
ically determined to be = 2.0023. Figure 12.7 represents the spin and magnetic 
moment of the electron schematically. 




Isl- /sls+1) h 



-e 



e 

^ls*-9s — s 

2m 




Fig. 12.7. Spin and magnetic moment Fig. 12.8. The electron spin has two possible orientations in 

of the electron a magnetic field in the z direction. These are characterised 

by the quantum number m^= ±1/2 



Dirac showed in 1928 that the spin of the electron is a necessary consequence of a 
relativistic quantum theory (the Schrodinger theory is non-relativistic). The g factor 
= 2 could also be thus derived. The slight difference between the predicted value of 2 
and the empirical value can only be understood if the interaction of the electron with its 
own radiation field is taken into account through quantum electrodynamics. 

As first shown by the experiment of Stern and Gerlach (Sect. 12.6), the spin can 
only have two orientations in an external magnetic field B (or in the presence of a 
defined z axis): “parallel” and “antiparallel” to the field (Fig. 12.8). Its components in 
this defined z direction are 

Sz = m^h with m^= ±\\ (12.17) 

m^ is the magnetic quantum number of the spin. 

It follows from the orientation of the angular momentum that the magnetic 
moment is also oriented. The z component is 

Ms,z= (12.18) 

or numerically, 



fis,z= ± 1 . 001 16 //B- 



(12.19) 
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Intuitively speaking, the spin and the magnetic moment precess around the field 
axis, leaving the z component constant (compare Sect. 12.3). 

The gyromagnetic ratio, which was defined above (12.12) as the ratio between the 
magnetic moment and the angular momentum. 



y 



M 

|/| 




( 12 . 20 ) 



is thus not the same for orbital (12.10) and spin (12.16) magnetism. For pure orbital 
magnetism, 



and for pure spin magnetism, 



mo 



The previously mentioned g factor is also used instead of the gyromagnetic ratio y. 
g is obtained by multiplying y by ^ and is defined for pure orbital magnetism as 

= ^ —h = gi/i^ ( 12 . 21 ) 

2 mo 

and for pure spin magnetism by 

= 1.00116 — ;j = ^s//B = 2.0023 ;Ub. (12.22) 

mo 



In the following, the reader will see that the easiest and most definitive way to 
calculate the magnetic properties of atoms is often to make use of measurements of the 
ratio y or g. 



12.5 Determination of the Gyromagnetic Ratio 
by the Einstein-de Haas Method 

The gyromagnetic ratios of macroscopic samples can be measured as shown in 
Fig. 12.9. An iron needle is magnetised by a coil. If one changes the magnetisation of 
the sample - and this means changing the direction of the atomic magnetic moments 
in the sample - one will also change the direction of the atomic angular momenta, and 
this must be observable as a change in the angular momentum of the whole sample, 
according to the law of conservation of angular momentum. If the magnetisation is 
changed by 180° by reversing the poles of the coil, the angular momentum vector must 
also be rotated through 180°. Quantitatively, the change ZI//n in the magnetisation of 
the needle, measured with a detection coil and a ballistic galvanometer, can be repre- 
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Fig. 12.9. Einstein-de Haas experiment. When the current in the coil is 
reversed, the magnetisable bar hanging in it turns. The torsion of the 
fibre on which the bar is suspended is measured with a mirror and light 
beam 



sented as the sum of the changes for the individual electrons, 

n 

Y,Afi^ = n-2n^, 

1 

if n electrons have reversed their directions. 

Likewise, the macroscopic change in the angular momentum of the needle, 
measured by means of the torsion fibre, is the sum of the changes of the atomic angular 
momenta: 

lAl, = n-2l,. 

1 

For macroscopic samples, the measured ratio 

Aju^ _ jUz _ e 
ZILn Iz ^0 

Thus according to the definition of (12.20), 

e ^ 

y = or g = 2. 

mo 

From this experiment it can be seen that there is an angular momentum associated 
with the magnetism of atoms, and that it can be calculated as derived above. 

In general, gyromagnetic ratio measurements, first described in 1915 by Einstein 
and de Haas, can indicate how much of the magnetism in a given sample is due to spin 
and how much to orbital angular momentum. However, a quantitative understanding 
of this type of measurement requires a deeper knowledge of solid state physics. There, 
it is found that in a solid, the orbital magnetic moment is often “quenched”, i.e. it ap- 
pears to be vanishingly small. 



12.6 Detection of Directional Quantisation by Stern and Gerlach 

In 1921, the deflection of atomic beams in inhomogeneous magnetic fields made 
possible 

- the experimental demonstration of directional quantisation and 

- the direct measurement of the magnetic moments of atoms. 
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S] 

Expected 

classical 

result 



In the experiment (Fig. 12.10), one first generates a beam of atoms. In the first 
experiments of Stern and Gerlach, this was a beam of silver atoms which was generated 
in an atomic beam furnace and collimated by a series of slits. Later, hydrogen atoms 
from a gas discharge were also used. The collimated beam passes through a highly in- 
homogeneous magnetic field, with the direction of the beam perpendicular to the direc- 
tion of the field and of the gradient. The directions of the field and gradient are the 
same. Without the field, the vectors of the magnetic moments and angular momenta of 
the atoms are randomly oriented in space. In a homogeneous field, these vectors 
precess around the field direction z. 

An inhomogeneous field exerts an additional force on the magnetic moments. The 
direction and magnitude of this force depends on the relative orientation between the 
magnetic field and the magnetic dipole. A magnetic dipole which is oriented parallel to 
the magnetic field moves in the direction of increasing field strength, while an anti- 
parallel dipole moves towards lower field strength. A dipole which is perpendicular to 
the field does not move. 

The deflecting force can be derived from the potential energy in the magnetic field 

„ dB dB 

Fz = t^z —r~ ^ ^ ~T~ ^ ’ (12.23) 

dz dz 

where a is the angle between the magnetic moment and the direction of the field 
gradient. 




Fig. 12.10. Stern-Gerlach experiment. The atomic 
beam passes through an inhomogeneous magnetic 
field. One observes the splitting of the beam into two 
components 




Fig. 12.11. Stern-Gerlach experiment. Observed 
intensity distribution of an atomic beam with 
and without an applied magnetic field, ^Sj /2 
state [from H. Kopfermann: Kernmomente, 2nd 
ed. (Akademische Verlagsgesellschaft, Frank- 
furt 1956)] 
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In classical mechanics, any orientation a of the atomic magnet with respect to the 
field is allowed. Atoms with magnetic moments perpendicular to the field gradient are 
not deflected. Those in which the vectors are parallel are deflected the most, and all 
possible intermediate values can occur. In the classical picture one thus expects a con- 
tinuum of possible deflections. With H and Ag atoms, however, two rather sharp peaks 
separated by 2d were observed on the detector (Fig. 12.11). 

This experiment and similar measurements on other atoms permit the following 
conclusions: 

- There is a directional quantisation. There are only discrete possibilities for the 
orientation relative to a field Bq, in this case two, parallel and antiparallel. 

- From a quantitative evaluation of the observed deflection 5 in the above example, 
one obtains the value ±fi^. In general this method provides observed values 
for atomic magnetic moments if the magnitude of the field gradient is known. 

- For all atoms which have an s electron in the outermost position, one obtains the 
same value for the deflecting force, from which it follows that the angular momenta 
and magnetic moments of all inner electrons cancel each other and one measures 
only the effect of the outermost s electron. 

- The s electron has an orbital angular momentum 1 = 0 and one observes only spin 
magnetism. 

- Like gyroscopes, atoms maintain the magnitude and direction of their angular mo- 
menta in the course of their motion in space. 

This experiment provides the basis for the knowledge of the angular momenta and 
magnetic moments of atoms which was summarised in Sects. 12.2 and 12.3. 



12.7 Fine Structure and Spin-Orbit Coupling: Overview 

In the introductory section to this chapter we mentioned that all energy terms - with 
the exception of the s states of one-electron atoms - are split into two substates. This 
produces a doublet or multiplet structure of the spectral lines, which is denoted by the 
generic name fine structure. 

The fine structure cannot be explained with the Coulomb interaction between the 
nucleus and the electrons. Instead, it results from a magnetic interaction between the 
orbital magnetic moment and the intrinsic moment of the electron, the spin-orbit 
coupling. Depending on whether the two moments are parallel or antiparallel, the 
energy term is shifted somewhat. 

The magnetic coupling energy between the orbital moment and the spin moment 
will be calculated in Sect. 12.8. The coupling of the magnetic moments leads to an 
addition of the two angular momenta to yield a total angular momentum. 

The following conclusions are then valid (Fig. 12.12): 

- / and s add to give a total angular momentum y; 

- j has the magnitude |/y(/-l- l)fi withy = |/±s|, i.e. y = |/±^| for the case treated 
here of a single-electron system with s = \. The quantum number y is a new quan- 
tity: the quantum number of the total angular momentum. We shall show with a 
quantum mechanical calculation in Sect. 14.3 thaty has the magnitude given above. 

- For a p electron with / = 1, s = ^, we find the following possibilities: 
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y = 3/2, |y I = 1/15/2 /i, and 
y = l/2, \j\ = \/yiTr, 

- when 1 = 0, j = s and there is no doublet splitting; 

- for j, just as for I, there is a directional quantisation. The z components must obey 
the condition 

= mjh, mj=j,j-\,...-j (2j + 1 possibilities) . 




Fig. 12.12. Coupling of the vectors of spin 5 and orbital Fig. 12.13. Directional quantisation: for the 

angular momentum I to give the total (resultant) angular z component of the angular momentum y , 

momentum j in the vector model. The vectors s and / only certain discrete values are allowed, 

precess about the resultant y. In a magnetic field applied They are denoted by the magnetic quantum 

in the z direction, y precesses about z. The opening angle number ruj. For the case illustrated, j = 3/2, 

of the cone of precession is determined by the magnetic the magnitude of the vector is [/ 1 = 

quantum number rUj. The figure shows the case 5 = 1/2, ]/(3/2)(5/2) h. Four orientations are al- 

I =2 J= 5/2 lowed: rrij = 3/2, 1/2, - 1/2, - 3/2 

For example, a state with j = 3/2 is fourfold degenerate (Fig. 12.13). 

- A magnetic moment juj is associated with y; this will be calculated in Sect. 13.3.5. 

- For optical transitions, a selection rule Jy = 0 or ± 1 is valid; however, a transition 
from y = 0 to y = 0 is always forbidden. This selection rule may be considered to be 
an empirical result, derived from the observed spectra. The reasons for it will 
become clear later (Chap. 16). 



12.8 Calculation of Spin-Orbit Splitting in the Bohr Model 

In this section, we shall calculate the energy difference between the parallel and the 
antiparallel orientations of the orbital angular momentum and the spin. For this 
purpose, the simple Bohr model will be used as starting point; the quantum mechanical 
treatment will be discussed in Sect. 14.3. 

The motion of the electron around the nucleus generates a magnetic field Bi at the 
site of the electron. This field interacts with the magnetic moment of the electron. To 
determine the magnitude of this magnetic field, we borrow from relativity theory and 
assume that the electron is stationary and that the nucleus moves instead (Fig. 12.14). 
We replace the position vector for the orbiting electron, r, by the vector -r. 
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Fig. 12.14. For the calculation of spin-orbit 
coupling, the system with the nucleus at rest 
(left) is transformed to the system with the 
electron at rest (right). The vector r is replac- 
ed by the reversed vector - r 



The magnetic field of the moving charge +Ze is found from the Biot-Savart law 
to be 

5,= +Z£^[^x(-r)] (12.24) 

47rr 

or 

(12.25) 

4;rr 

Angular momentum is defined as I = r x niov or —l = moVXr. Then 

B,= /, (12.26) 

AnrmQ 

where rriQ is the rest mass of the electron. 

The magnetic field which is generated by the relative motion of the nucleus and the 
electron is thus proportional and parallel to the orbital angular momentum of the 
electron. We still require the back transformation to the centre-of-mass system of the 
atom, in which the nucleus is essentially at rest and the electron orbits around it. A 
factor 1/2 occurs in this back transformation, the so-called Thomas factor, which can 
only be justified by a complete relativistic calculation. The particle in its orbit is 
accelerated, and from the viewpoint of the proton, the rest system of the electron 
rotates one additional time about its axis during each revolution around the orbit. The 
back transformation is therefore complicated and will not be calculated in detail here. 

The magnetic moment of the electron, and with it, its coupled spin vector, precess 
about the magnetic field produced by the orbital motion (cf. Fig. 12.15). 




Fig. 12.15. Precession of the spin about the magnetic field Bi associated 
with the orbital angular momentum, with the components and ^ 
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The interaction energy between the spin and the orbital field is thus 

-Ms-B,. 

Substituting - see (12.16), Qs = 2 - we find 
2 mo 



and with (12.26) 



Ze^Ho 

Snmor^ 



(s-l). 



(12.27) 



Here we have included the (underived) Thomas correction; this gives the factor 8 in 
the denominator (instead of 4). 

In order to get a feeling for the order of magnitude, we set Z = 1 and r = 1 A and 
obtain T) j = 10“^ eV. The field produced by the orbital motion at the position of the 
electron is found to be about 1 tesla = 10"^ gauss. The fields associated with the orbital 
angular momentum are thus - for small values of Z - of the same order of magnitude 
as may be readily produced in the laboratory. 

Equation (12.27) may also be written in the form 



F/s = = -^ |/||^ |cos(/,s) (12.28) 

h 

where a = Ze^fiofi^/(S nmoP). The scalar product I • s may be expressed in terms of the 
vectors / and s by using the law of cosines according to Fig. 12.16, where we recall that 
must be replaced by its quantum value /(/+ \)h^, etc. We thus obtain for the spin- 
orbit coupling energy 

F/,s = ^(|/+5p-|/p-kl') 

2h 

= y[/0+ !)-/(/+ l)-5(5+l)] . (12.29) 




Fig. 12.16. Vector addition of the angular momentum vectors to the 
total angular momentum j, explanation of (12.29) 



The spin-orbit coupling energy is thus expressed in terms of the quantum numbers j, I 
and 5', as well as a constant a, known as the spin-orbit coupling constant. The latter is 
directly measurable by determination of the doublet structure in the optical spectra. 
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Comparison with (12.27) shows that the orbital radius r is included in this coupling 
constant a. We must remember, however, that there are no fixed orbits in the quantum 
theoretical description of the atom. Therefore it is necessary to replace by the corre- 
sponding quantum theoretical average \/r^ = \{\\f/^/r^)dV, where if / is the wavefunc- 
tion of the electron and dV the volume. 

If we use the radius r„ of the «th Bohr radius as a rough approximation for r. 



Aneoh^n^ 

Ze^rriQ 



(12.30) 



we obtain 




If instead we use the above-defined average value r we obtain for atoms similar to H 



«'/(/+!)(/+!) ’ 



(12.31) 



which will not be derived here. 

Let us again summarise what we know about the fine structure of one-electron 
states: 

- Interaction of the electron with the orbital angular momentum or the orbital 
moment splits each level into two. The result is doublet levels; for example in the 
upper state of the sodium D lines, the 3P state is split into the 'iP \/2 and the 'iPy 2 
states (Fig. 12.17). 




Fig. 12.17. Fine structure splitting of the P state in the one elec- 
tron system into the two states P 3/2 and P\/ 2 - The magnitude of 
the splitting is given by (12.29). Since only one electron is in- 
volved, one can also use lower case “/?” 



- For s terms there is no splitting, because there is no magnetic field with which the 
spin can align itself. 

- Levels with higher values of the quantum number j have higher energies (12.29). 

- The fine structure splitting is proportional to the fourth power of the nuclear 
charge. 

The fine structure is therefore difficult to observe in the H atom. For the H^, H^, 
and Hy lines of the Balmer series (6562.79, 4861.33 and 4340.46 A), the splitting is 
0.14, 0.08 and 0.07 A, respectively. This corresponds to a wavenumber of 0.33 
cm~^ for the H^line, which is in the microwave range - if one wished to observe it 
directly. A direct observation of the splitting of optical spectral lines into two very 
close components is not possible by conventional spectroscopy because of Doppler 
broadening of the lines. In the lithium atom, the splitting of the first resonance line 
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is likewise only 0.337 cm“^ By contrast, the observed values for a line pair of the 
1st primary series, i.e. for the first resonance lines, of cesium (Z = 55, n = 6) are 
A = 8943 A and 8521 A . The splitting is thus AA = 422 A or Zl v = 554 cm" ^ It is 
so large, in fact, that the two lines are difficult to recognise as components of a pair. 
The sodium atom (Z = 11, n = 3) lies between these extremes: the yellow D lines 
Di and D 2 are separated by Zl A = 6 A, which corresponds to 17.2 cm" ^ 

- The splitting is greatest for the smallest principal quantum number n (12.31). 

We can now expand upon the symbolism needed to identify the energy terms of 
atoms. The terms for orbital angular momentum are generally indicated by upper case 
letters S, P, D, F, etc. The principal quantum number n is written as an integer in front 
of the letter, and the total angular momentum quantum number j as a subscript. The 
multiplicity 25+ 1 is indicated by a superscript to the left of the orbital angular momen- 
tum letter. For single-electron systems, the terms are doublet terms, because the spin of 
the single electron can have two orientations with respect to the orbital angular momen- 
tum. 

The S terms are not split. Nevertheless, one writes the multiplicity 2 even for S terms 
in one-electron systems. 

One thus has the following symbols: 

2 ^Si /2 for a state in which the valence electron has the quantum numbers n = l, 
l=0,j=\/2, 

1 for states in which the valence electron has the quantum numbers n = 2, 
2^^372 I 1= \ ,j = 1/2 or 3/2, respectively. 

In general, the symbolism is j. The upper case letters S (spin quantum number), 

L (orbital angular momentum quantum number) and J (total angular momentum 
quantum number) apply to several-electron atoms, while the corresponding lower case 
letters apply to single electrons. 



12.9 Level Scheme of the Alkali Atoms 

For an atom with one electron in the incomplete outer shell, the results of Sect. 12.7 can 
be summarised in the term scheme of Fig. 12.18. This figure should make it clear that 
both the lifting of orbital degeneracy (i.e. the energy difference between terms with the 

S P D F 

- 7/2 — 

5/2 



Fig. 12.18. Term scheme for alkali atoms, i.e. one-electron states, 
including the spin-orbit splitting. The levels are not shown to scale. 
A few allowed transitions are indicated. The terms are displaced 
with respect to those of the H atom (/i = 2, 3, 4, left side, dashed 
lines), the s terms most. The fine structure splitting decreases with 
increasing values of n and / 
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same n but different / quantum numbers) and spin-orbit splitting become smaller as the 
quantum numbers n and / increase. 

The optical transitions in the term scheme obey the rules zl/=±l,Zly=±lorO. 
Optical transitions are thus allowed only if the angular momentum changes. The total 
angular momentum y, however, can remain the same. This would happen if the orbital 
angular momentum and the spin changed in opposite directions. 

The first principal series of the alkali atom arises from transitions between the 
lowest ^5 i/ 2 term (i.e. n = 2, 3,4, 5, 6 for Li, Na, K, Rb, Cs) and the P terms '^P\/i and 
^P 3 / 2 . Since the S terms are single- valued, one sees pairs of lines. The same holds for the 
sharp secondary series, which consists of transitions between the two lowest P terms 
n^P ^/2 (/? = 2, 3, 4, 5, 6 for Li, Na, K, Rb, Cs) and all higher ^S \/2 terms. The lines of the 
diffuse secondary series, however, are triple (Fig. 12.19), because both the P and the D 
terms are double. 




Fig. 12.19. Allowed and forbidden transitions 
between P and D states of the alkali atom, here for a 
triplet of the diffuse secondary series. This is a 
section from Fig. 12.18 



12.10 Fine Structure in the Hydrogen Atom 



Since the wavefunctions of the H atom are known explicitly (Chap. 10), its fine struc- 
ture can be exactly calculated. The starting point is the expression derived above (12.27) 
for the spin-orbit splitting energy: 






Snml 



r 



(12.32) 



We use the solution of the non-relativistic Schrodinger equation for the H atom, 
which provides the energy states E^j (Sect. 10.4). For the H atom, both the relativity 
correction (cf. Sect. 8.10) and the fine structure interaction are small compared to the 
energies but the two are of comparable magnitude. One can therefore calculate 
the two corrections separately and write 



^nj,j - ^n,/ + ^rel + ^/,s • 

The two correction terms, the one for the relativistic mass change F’rei the other for 
the spin-orbit coupling s, together give the fine structure correction E^^. These terms 
will not be calculated in detail here. The complete calculation was carried out by Dirac. 
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As a result, one obtains 



£fs= , (12.33) 

n y+1/2 AnJ 

where 

4neohc \ Anh / 

which is the Sommerfeld fine structure constant which was introduced in Sect. 8.10. 

By including the spin-orbit coupling, one thus obtains the same result as earlier 
(Sect. 8.10) in the calculation of the relativistic correction, the only change being that / 
has been replaced by y. The energy shift with respect to the previously calculated energy 
terms £'„,/is of the order of i.e. (1/137)^ and is thus difficult to measure. 

The most important result of (12.33) is the fact that the fine structure energy of the 
H atom depends only on y, not on /. This means that terms with differing / quantum 
numbers (for the same n) and the same quantum number y have the same energies; they 
are energetically degenerate (Fig. 12.20). 



S P D 




Fig. 12.20. Fine structure splitting of the 
states with « = 1 , 2 and 3 (far left, dashed 
lines, the levels without fine structure), 
according to Dirac (not to scale). The fine 
structure shifts are indicated by open arrows. 
States with the same / are degenerate without 
fine structure interactions. States with the 
samey have the same energy if fine structure 
is taken into account 



The fine structure of the hydrogen lines is thus quantitatively accounted for. The 
fine structure energies of heavier atoms are larger and are thus easier to observe. Their 
calculations, however, are far more difficult, because the exact calculation of the wave- 
functions of atoms with more than one electron is far more complex. 



12.11 The Lamb Shift 

In the years 1947 - 1952, Lamb and Retherford showed that even the relativistic Dirac 
theory did not completely describe the H atom. They used the methods of high-fre- 
quency and microwave spectroscopy to observe very small energy shifts and splitting in 
the spectrum of atomic hydrogen. In other words, they used the absorption by H atoms 
of electromagnetic radiation from high-frequency transmitters or klystron tubes. They 
could, in this way, observe energy differences between terms with the same y, namely 
differences of 0.03 cm~^ - this corresponds to a difference of 900 MHz - between the 
terms and 2 ^Pi/ 2 . 
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Fig. 12.21. The Lamb shift: fine structure of 
the « = 2 level in the H atom according to 
Bohr, Dirac and quantum electrodynamics 
taking into account the Lamb shift. The j de- 
generacy is lifted 



They achieved a precision of 0.2 MHz. Figure 12.21 shows the corresponding 
energy diagram. 

Like the fine structure, this small energy shift was not observable by means of 
optical spectroscopy as a splitting of the line of hydrogen, because the Doppler 
broadening of the spectral lines due to the motion of the atoms exceeded the magnitude 
of the splitting. 

The Lamb-Retherford result can be generalised: levels with the same quantum 
numbers n and y, but different /, are not exactly the same. Rather, all S 1/2 terms are 
higher than the corresponding P 1/2 terms by an amount equal to about 10% of the 
energy difference (Psn-Pm)^ and the Pyi terms are higher than the D ^/2 terms by 
about 2%o of (£> 5 / 2 -Z? 3 / 2 ). 




Fig. 12.22. Arrangement for measurement of the Lamb shift. A beam of H atoms is excited to the metastable 
2S^/2 state by bombardment with electrons. The beam passes through a resonator. If electromagnetic transi- 
tions are induced there, the number of excited atoms reaching the tungsten foil receiver is lower, and the 
measured electron current correspondingly drops. The magnetic field B serves to create an additional 
energetic separation between the 5^/2 and the P ^/2 states. This prevents mixing of those states and thereby 
avoids an immediate decay via the 2 P ^/2 state 



The Lamb and Retherford experiment is shown in Fig. 12.22. A beam of hydrogen 
atoms is generated from H 2 molecules by thermal dissociation at 2500 °C. A small 
number of these atoms is excited to the metastable state by bombardment by 
electrons. Optical transitions between this state and the ground state l^Si /2 forbid- 
den. The atoms then pass through a tunable resonator for high-frequency or micro- 
wave radiation, to a tungsten foil. There the metastable atoms can give up their excita- 
tion energy, thereby releasing electrons from the surface of the metal. The electron 
current is measured and serves as an indicator of the rate at which atoms in the 2 ^S \/2 
state arrive at the detector. Those atoms which are excited to the 2^P3/2 state by absorb- 
ing microwaves in the range of 10000 MHz in the resonator (compare term scheme in 
Fig. 12.21) can emit light at the wavelength of the H^^ line (or more exactly, of one 
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Fig. 12.23. Structure of the line of hydrogen and term scheme 
including the fine structure. The expected optical spectrum is 
shown below (ignoring the line widths). Darker lines indicate higher 
intensity. The wavenumbers are in cm“^ 



Fig. 12.24. Above: Structure of line of the hydrogen atom at 
room temperature. The linewidth and thus the spectral resolu- 
tion is determined by the Doppler width. Below: The method of 
Doppler-free spectroscopy (saturated absorption using a dye laser, 
Sect. 22.3) allows resolution of the individual components of the 
line (after Hdnsch et al.). The two additional very weak lines 
shown in Fig. 12.23 are omitted in Fig. 12.24 



component of this line) and return to the ground state. When absorption of this type 
occurs, the electron current in the tungsten foil decreases. Lamb and Retherford found 
in 1947 that the same effect, a decrease in the electron current, occurred on absorption 
or induced emission of radiation at a frequency of about 1000 MHz. This was due to 
the transition from the to the 2^P^/i state. From the latter state, radiative tran- 

sitions to the ground state are also allowed. It was thus shown that even states with 
the same total angular momentum j are energetically different. 

The term scheme of an atom can be refined for optical transitions as well. Figure 
12.23 shows the complete term scheme for the line of the hydrogen atom. This line 




Fig. 12.25. Lamb shift and fine struc- 
ture of the helium atom: fine structure 
at 1640 A. Seven components are ob- 
served. The lines 1,2 and 3,4 would be 
unsplit without the Lamb shift. [From 
G. Herzberg: Trans. Roy. Soc. Can. 5, 
(1967) Fig. 5] 
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consists of 7 components of different intensities in an energy range of about 0.4 cm"^ 
The upper part of Fig. 12.24 shows the structure of the line, as it can be observed in 
the presence of Doppler broadening. The lower part shows a curve obtained by the 
modern method of Doppler-free spectroscopy (Sect. 22.3). With this method, the 
Lamb shift can also be seen in optical spectra. The fine structure splitting and the Lamb 
shift are larger in the heavier helium atom, so that direct optical observation of the 
splitting is easier than with the H atom. Even without removing the Doppler broaden- 
ing, the fine structure can be resolved, as Fig. 12.25 shows. 

The Lamb shift was of utmost importance for the development of quantum electro- 
dynamics. Previously, this theory treated only the emission and absorption of light 
quanta in atomic transitions. To explain the Lamb shift, it was necessary to go one step 
further. It had to be assumed that the electrons in an atom were continually emitting 
and reabsorbing light quanta, in which process energy conservation can apparently be 
violated. 

This “violation of energy conservation” must not, to be sure, be taken too literally. 
According to the uncertainty relation between energy and time (7.29), the energy is only 
sharply defined when a measurement is performed over a sufficiently long period of 
time. It is thus completely consistent with energy conservation that an electron can emit 
a quantum even without having the necessary energy, as long as the quantum is reab- 
sorbed quickly enough. Much more decisive for the theoreticians was, however, the 
recognition that the energy shifts in the atomic levels (on a negative energy scale) pro- 
duced by these “virtual” processes were infinitely large. A free electron can also con- 
tinually emit and absorb virtual quanta; its energy decreases infinitely in the process. 
Energy shifts caused by virtual processes are tQxmQd self energy. Experimentally, a free 
electron, like a bound electron, has a well-defined, finite energy. The basic idea for 
solving the “infinity problem” of the energy shift was the recognition that only the dif- 
ference between the energies of bound and free electrons is physically interesting. Or, in 
other words: to calculate the energy shift of bound electrons, one must subtract the 
self-energy of a free electron from that of a bound electron in a particular atomic state 
(cum grano salis). This process is termed “renormalisation”. Since the masses also 
become infinite due to virtual processes, they must also be “renormalised”. Naturally, 
at first glance it seems very adventurous to subtract two infinite quantities from one 
another in order to obtain a well-defined finite result. In the framework of quantum 
electrodynamics, however, it was found possible to set up well-defined rules for the 
renormalisation procedure, and the Lamb shift can be calculated today with great 
precision. The important result is that the validity of quantum electrodynamics can 
therefore be tested - and has been verified - in an excellent manner. 

A summary of the theoretical treatment is given in Sect. 15.5.2. In preparation for 
this treatment, in Sect. 15.5.1 we introduce the quantisation of the electromagnetic 
field, which follows immediately from the quantisation of the harmonic oscillator. As 
is shown in one of the problems for Sect. 15.5.1 , the theory of the Lamb shift has a sur- 
prisingly simple physical explanation: the quantum-mechanical zero-point fluctuations 
of the electromagnetic field act statistically on the electrons and thus cause a shift of 
their potential energy. 
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Problems 



12.1 Calculate the precessional frequency of electrons and of protons [/= 1/2, mag- 
netic moment = (2.79/1836 ) -^b] in the magnetic field of the earth (- 2- 10"^ tesla). 

12.2 In the Stern-Gerlach experiment, a beam of silver atoms in the ground state 
(5 ^Si/ 2) is directed perpendicularly to a strong inhomogeneous magnetic field. The field 
gradient is dB/dz = 10^ tesla/m. In the direction of the atomic beam, the magnetic 
field extends a distance of /i = 4 cm, and the catcher screen is a distance /2= 10 cm 
from the magnet. Calculate the components of the magnetic moment in the direction of 
the magnetic field, if the splitting of the beam at the screen is observed to be = 2 mm, 
and the velocity of the atoms is v = 500 m/s. The average mass of silver atoms is 
M= 1.79 • 10"^^ kg. Why doesn’t the nuclear spin affect the experiment? 

12.3 How large is the magnetic field generated by the electron in the ground state of a 
hydrogen atom, at the position of the proton if it would circulate according to Bohrs 
model on the shell n = 11 



12.4 How large is the magnetic moment of the orbital motion in a muonium atom, in 
which the electron of a ground-state hydrogen atom has been replaced by a muon? 
How large is the moment in positronium (an electron and a positron, i.e. particles with 
the mass of the electron and opposite charges, moving around the common centre of 
mass)? 



12.5 Calculate the spin-orbit splitting of the states of the hydrogen atom with n = 2 
and « = 3 using the relations 



Tz _1 Ze Mo 

^l,s ~ 2 

n Snmnr 



and 



What are the values for a Rydberg state with n = 30 for the largest (/ = 1) and the 
smallest (/ = 29) splitting? 

Oq is the radius of the innermost Bohr orbit. 

12.6 In the cesium atom, spin-orbit splitting between the states 6 P 1/2 and 6P3/2 leads 
to a wavelength difference of AX = 422 A for the first line pair of the primary series, 
with A = 8521 A for the line with the shorter wavelength. Calculate from this the fine 
structure constant a and the field at the nucleus Bj. Use (12.27). 

12.7 Sketch the energy levels of the hydrogen atom, including the fine structure, up to 
n = 3. Show the possible transitions. How many different lines are there? 



12.8 The fine structure in hydrogen-like ions (ions with only one electron) is described 
by (12.33). 




Problems 
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a) Show that the correction term does not disappear for any possible combination of 
the quantum numbers n and y, but that it always reduces the value of the uncor- 
rected energy. 

b) Into how many energy levels are the terms of singly charged helium with the 
principal quantum numbers « = 3 and n = A split by the fine structure interaction? 

c) Sketch the positions of these levels relative to the non-shifted terms and give the 
amount of the shift. 

d) Determine which transitions are allowed, using the selection rules Al= ±l,Zly = 0 
or ±1. 

12.9 Give the relative splitting of the various levels of an LS/ multiplet due to spin- 

orbit interaction for the ^Fand multiplets. Sketch the energy levels of these multi- 
plets and indicate with arrows the allowed transitions. Repeat the above process 

for the and transitions. 

12.10 The interaction energy E between two magnetic moments /ij and /12 is (r = the 
radius vector of /ii and 

^ //Q \ fix' fii 
47t 

a) Under which conditions is F = 0 for a given |r | ? 

b) For parallel moments, which arrangement yields an extreme value for F? 

c) For case b) with |/* | = 2 A, calculate the energy for the electron-electron and proton- 
proton interactions. In each case, how large is the magnetic field at fi 2 due to fi\ 

(A^proton=1.4-10-2^Am2)? 





13. Atoms in a Magnetic Field: 

Experiments and Their Semiclassical Description 



13.1 Directional Quantisation in a Magnetic Field 

In the previous chapters, we have already seen that a directional quantisation exists. 
The angular momentum vectors in an atom can only orient themselves in certain 
discrete directions relative to a particular axis (the quantisation axis). The directional 
quantisation is described by the magnetic quantum number m. In an applied magnetic 
field Bq, the interaction energy between the field and the magnetic moment of the 
electrons in an atom, which we have already calculated, leads to a splitting of the 
energy terms, which is described by the different possible values of the magnetic 
quantum number. We shall concern ourselves in this chapter with the measurement of 
this energy splitting. 

A first application of the splitting of atomic states in a magnetic field to the deter- 
mination of the magnetic moments of the atoms was already discussed in the treatment 
of the Stern-Gerlach experiment. In the following, we shall consider some other types 
of experiments. 



13.2 Electron Spin Resonance 



The method of electron spin resonance (abbreviated ESR, sometimes EPR for electron 
paramagnetic resonance) involves the production of transitions between energy states 
of the electrons which are characterised by different values of the magnetic quantum 
number m. In general, the degeneracy is lifted by the application of an external 
magnetic field; the transition frequencies, which are usually in the range of microwave 
frequencies, depend on the strength of the applied field. With this technique, one can 
observe transitions between states of different magnetic quantum number directly. In 
Zeeman spectroscopy, to be described later, the transitions observed are in the optical 
region, and their response to magnetic fields is studied; in this case, the transitions 
cause changes in not only the magnetic quantum number, but also in the other 
quantum numbers. 




Fig. 13.1. The spin of an electron, and thus its magnetic moment, have two 
possible orientations in an applied magnetic field. They correspond to two 
values of the potential energy 
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The principle of ESR may be easily understood by considering the magnetic 
moment produced by the spin of a free electron in a magnetic field Bq (Fig. 13.1). 

An electron has the magnetic moment 



;Ms= 1 / 5 ( 5 + 1)//b0S (13.1) 

with the possible components along the quantisation axis z of the field Bq 

C«s)z= (13.2) 

The potential energy of these two orientations differs by the amount 



AE = gsiiBBo^ (13.3) 

If a sinusoidally varying magnetic field = Bi sin cot is now applied in a direction 
perpendicular to transitions between the two states are induced if the frequency 
V = co/2 71 fulfils the condition 



AE = hv = 



(13.4) 



or, in numbers, 

v = 2.8026 • 10‘® -5o Hz (tesla) . (13.5) 

The transitions with = ± 1 are allowed magnetic dipole transitions. A quantum 
mechanical treatment of ESR will follow in Chap. 14. The frequency which must be 
used depends, according to (13.5), on the choice of the applied magnetic field Bq, For 
reasons of sensitivity, usually the highest possible frequencies are used, corresponding 
to the highest possible magnetic fields. The fields and frequencies used in practice are, 
of course, limited by questions of technical feasibility; usually, fields in the range 0.1 to 
1 T are chosen (T = tesla). This leads to frequencies in the GHz region (centimetre 
waves). 

What we have here described for a free electron is also valid for a free paramagnetic 
atom. In this case, the total resultant magnetic moment produced by the spin and or- 
bital angular momenta of the atom, /ly, must be used in (13.3-5). 




Fig. 13.2. Demonstration experiment for electron spin resonance: a 
gyroscope whose axle is a bar magnet is precessing in a magnetic 
field Bq (as well as in the gravitational field of the earth). The in- 
clination of the axis of the gyroscope relative to Bq may be changed 
by means of an oscillating field B^ if the frequency of B^ is equal to 
the precession frequency of the gyroscope. For a lecture demonstra- 
tion, it is expedient to construct the gyroscope in such a way that it 
is driven from the support pedestal S, for example using com- 
pressed air and following the principle of a water turbine 




13.2 Electron Spin Resonance 



207 



The fundamental idea of ESR may be illustrated by a mechanical model (Fig. 13.2): 
a gyroscope containing a bar magnet in its axis is precessing in a magnetic field. The 
precession frequency is (neglecting gravitational force) 



(Ol = 



IT I 



(13.6) 



where // is the magnetic moment of the bar magnet and L is the angular momentum 
of the gyroscope. 

The precession frequency or better, angular velocity coi of a magnetic gyroscope in 
a magnetic field is independent of the angle a between // and Bq, since the torque pro- 
duced by the field and the rate of change of the angular momentum vector both depend 
in the same way on the sine of the angle a (12.12). When gravitational force is neglected, 
the frequency o>l is determined only by the magnetic moment ju and the angular 
momentum L of the gyroscope, as well as by the torque produced by the field Bq. 

When we now let an additional oscillating field B^ with the frequency co act perpen- 
dicular to Bq, we observe a continuous increase or decrease in the angle of inclination 
a, depending on whether the field is in phase or out of phase with the motion of the 
gyroscope, provided that the frequency oj is equal to a>i. 

This model may be immediately transferred to the atom. We replace the magnetic 
moment of the bar magnet by the moment of the atom and obtain for the circular fre- 
quency of the electron spin resonance the following condition: 



(13.7) 

This is the Larmor frequency, which was already introduced in Sect. 12.3. 

In the classical gyroscope model, the tip of the gyroscope axle moves on a spiral 
orbit from one stable position to another. This picture may be applied with consider- 
able accuracy to the motion of the spin or the orbital angular momentum in an atom. 
There is an additional possibility for picturing the resonant transitions, which makes 
use of the fact that the spin or the angular momentum of an atom has only certain 
discrete allowed stationary orientations in a constant magnetic field ^o- Iri this picture, 
the spin makes transitions between these discrete energy levels under the influence of 
the oscillating field B^. In particular, this means in the case of spin 1/2 that the spin 
flips from the one possible orientation to the other when the resonance condition (13.7) 
is fulfilled. 

Electron spin resonance was observed for the first time in 1944 by the Russian phy- 
sicist Zavoisky. The analogous spin resonance of paramagnetic atomic nuclei is seen 
under otherwise identical conditions at a frequency which is 3 orders of magnitude 
smaller, due to the fact that nuclear moments are about a factor of 1000 smaller than 
atomic magnetic moments; the corresponding frequencies are in the radio frequency 
region. This nuclear magnetic resonance (NMR) was observed in the solid state for the 
first time in 1946 by Bloch and Purcell, nearly 10 years after it had first been used by 
Rabi to measure the gyromagnetic ratio of nuclei in gas atoms (cf. Sect. 20.6). 

A schematic of an ESR apparatus is shown in Fig. 13.3. Today, ESR spectrometers 
count as standard spectroscopic accessories in many physical and chemical labora- 
tories. For technical reasons, usually a fixed frequency is used in the spectrometers; 
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Fig. 13.3. Electron spin resonance. Above: Schematic 
representation of the experimental setup. The sample 
is located in a resonant cavity between the pole pieces 
of an electromagnet. The microwaves are generated by 
a klystron and detected by a diode. To increase the 
sensitivity of detection, the field Bq is modulated. Be- 
low, left: Energy states of a free electron as functions 
of the applied magnetic field. Below, right: Signal U 
from the diode as a function of Bq for resonance 



the magnetic field is varied to fulfil the resonance condition and obtain ESR transitions 
in absorption and sometimes in emission. The sample is usually placed in a microwave 
resonator; a frequently used wavelength is 3 cm (the so-called X-band). The microwave 
radiation is generated by a klystron and detected by a high frequency diode or a bolo- 
meter. 

ESR is utilised for 

- precision determinations of the gyromagnetic ratio and the g factor of the electron; 

- measurement of the g factor of atoms in the ground state and in excited states for 
the purpose of analysing the term diagram; 

- the study of various kinds of paramagnetic states and centres in solid state physics 
and in chemistry: molecular radicals, conduction electrons, paramagnetic ions in 
ionic and metallic crystals, colour centres. 

The full importance of ESR will only become clear after we have treated the topic of 
hyperfine structure, i.e. when we discuss the interaction of the electronic spin with the 
spins of the neighbouring nuclei. Using this interaction, termed hyperfine splitting 
(Chap. 20), one can determine the spatial distribution of the electrons in molecules, in 
liquids, and in solids. More information about ESR and its significance for the physics 
of atoms, molecules, and solids is given in H. Haken and H.C. Wolf: Molecular Phys- 
ics and Elements of Quantum Chemistry, Chap. 19. 



13.3 The Zeeman Effect 

13.3.1 Experiments 

The splitting of the energy terms of atoms in a magnetic field can also be observed as a 
splitting of the frequencies of transitions in the optical spectra (or as a shift). A splitting 
of this type of spectral lines in a magnetic field was observed for the first time in 1896 
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Fig. 13.4. Demonstration experiment for the Zeeman effect. A flame coloured 
with sodium or NaCl appears dark when projected using light from a Na 
vapour lamp. Upon switching on a magnetic field, it brightens, since the reso- 
nance between the light from the lamp and the light of the sodium flame is de- 
stroyed by the Zeeman effect. The wavelength of the light from the flame is 
shifted slightly by the magnetic field; this suffices to remove the resonance 



by Zeeman. The effect is small; for its observation, spectral apparatus of very high re- 
solution is required. These are either diffraction grating spectrometers with long focal 
lengths and a large number of lines per cm in the grating, or else interference spectro- 
meters, mainly Fabry-Perot interferometers. We shall discuss this topic in more detail 
in Chap. 22. 

There is, however, a simple lecture demonstration (Fig. 13.4) which shows the shift 
of the spectral lines in a magnetic field in a drastic manner: a flame, coloured yellow 
with sodium, is opaque to the yellow light of a sodium vapour lamp, because the latter 
represents resonance light, i.e. light whose wavelength matches the absorption and 
emission wavelength in the flame. If, however, a magnetic field is applied to the flame, 
the resonance between the light source (Na lamp) and the absorber (Na flame) is de- 
stroyed. On the observation screen, the previously “dark” flame brightens, because it 
has now become transparent to the light from the Na vapour lamp. 

With a Fabry-Perot interferometer or with a grating spectrometer of sufficient 
resolution, the splitting in magnetic fields may be quantitatively measured. The split- 
ting behaviour observed in moderate magnetic fields is illustrated in Figs. 13.5 and 
13.6. The splitting of the cadmium line in Fig. 13.5 is called the “ordinary” Zeeman 
effect; using transverse observation (i.e. observation perpendicular to the direction of 
the applied magnetic field. Fig. 13.7), one sees the unshifted line as well as two 
symmetrically split components, each linearly polarised. With longitudinal observation 
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Fig. 13.6. Anomalous Zeeman effect, here 
using the sodium D lines. The line splits 
into four components, the D 2 line into six in 
a magnetic field. The wavelengths of the D| 
and D 2 lines are 5896 and 5889 A; the quan- 
tum energy increases to the right in the dia- 
gram 
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◄ Fig. 13.5. Ordinary Zeeman effect, e.g. for the atomic Cd line at A = 6438 A. With 
transverse observation the original line and two symmetrically shifted components are 
seen. Under longitudinal observation, only the split components are seen. The polar- 
isation {E vector) is indicated 
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(parallel to the field lines), only the two shifted components are seen; they are circularly 
polarised in this case. 

The splitting behaviour of the D lines of the sodium atom shown in Fig. 13.6 is 
typical of the anomalous Zeeman effect. The number of components into which the 
spectral lines are split is greater than in the normal Zeeman effect. Both the ordinary 
and the anomalous Zeeman effects merge to the so-called Paschen-Back effect in suf- 
ficiently large magnetic fields Bq, We shall now discuss these three effects of the 
influence of magnetic fields on the spectral lines and the energy terms of atoms. 




s 

Longitudinal 
observation 



Transverse 

observation 

I 



Fig. 13.7. Transverse and longitudinal observation of spectral lines 
in a magnetic field. The three component electrons used in the clas- 
sical description of the Zeeman effect are indicated (orbits with 
arrows in the pole gap of the magnet). The emission of a light 
source in the magnetic field is observed either transversely or longi- 
tudinally (through a hole drilled in the magnet pole piece). S is the 
entrance slit of a spectrometer 



13.3.2 Explanation of the Zeeman Effect from the Standpoint 
of Classical Electron Theory 

The Zeeman effect may be understood to a large extent using classical electron theory, 
as was shown by Lorentz shortly after its discovery. We shall restrict ourselves to the 
ordinary Zeeman effect - the splitting of states with pure orbital angular momentum. 
If the resultant angular momentum is composed of both spin and orbital contributions, 
one speaks of the anomalous Zeeman effect. The normal Zeeman effect describes states 
in which no spin magnetism occurs. In these states, at least two electrons contribute 
in such a way that their spins are coupled to zero. Therefore, the normal Zeeman effect 
is found only for states involving several (at least two) electrons, which are treated in 
Chap. 17. 

We discuss the emission of light by an electron whose motion about the nucleus is 
interpreted as an oscillation, for example by considering the projection of the motion 
on a certain direction. We ask the question, “What force does a magnetic field exert on 
a radiating electron?” The radiating electron is treated as a linear oscillator with a 
random orientation with respect to the magnetic lines of force (Fig. 13.8). 

In the model, we replace the electron by three component oscillators according to 
the rules of vector addition: component oscillator 1 oscillates linearly, parallel to the 
direction of Bq, Oscillators 2 and 3 oscillate circularly in opposite senses and in a plane 




Bo 



Fig. 13.8. Classical explanation of the Zee- 
man effect. An oscillating electron is re- 
solved into three component oscillators. 
Further details in the text 
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perpendicular to the direction of Bq, This resolution into components is allowed, since 
any linear oscillation may be represented by the addition of two counterrotating 
circular ones. Without the field Bq, the frequency of all the component oscillators is 
equal to that of the original electron, namely cuq. 

We now inquire as to the forces which the magnetic field exerts on our three com- 
ponent electron-oscillators: 

- Component 1, parallel to Bq, experiences no force. Its frequency remains un- 
changed; it emits light which is linearly polarised with its E vector parallel to the 
vector ^ 0 - 

- The circularly oscillating components 2 and 3 are accelerated or slowed down by the 
effect of magnetic induction on turning on the field Bq, depending on their direc- 
tions of motion. Their circular frequencies are increased or decreased by an amount 

d(D = {(e/mo)Bo = (//b/^)^o • (13.8) 

This is almost the same expression as that which we have already come to know as 
the Larmor frequency. It differs from the Larmor frequency only by a factor 2, 
because we are here dealing with an orbital moment (^ = 1) instead of a spin moment 
(g = 2) as in the case of the Larmor frequency, which applies to electron spin 
resonance. 

Classically, one can calculate the frequency shift dcu for the component oscillators 
as follows: without the applied magnetic field, the circular frequency of the component 
electrons is coq. The Coulomb force and the centrifugal force are in balance, i.e. 

2 

mojQr = 

In a homogeneous magnetic field Bq applied in the z direction, the Lorentz force acts in 
addition; in Cartesian coordinates, the following equations of motion are then valid: 



mx+mo)lx-eyBo = 0 , 


(13.9a) 


my + m(oly + exBo -- 0 , 


(13.9b) 


mz+ma>oZ =0. 


(13.9c) 



From (13.9c), we immediately find the solution for component oscillator 1, z = 
Zoexp(icuo0> i-C- the frequency of the electron which is oscillating in the z direction 
remains unchanged. 

To solve (13.9a) and (13.9b), we substitute u = x-\-\y and v=x-iy. It is easy 
to show that the equations have the following solutions (with the condition 
eBo/2m < coq): 

u = UQQxp[\(o)o-eBo/2m)t] and v = VoQxp[i(a>o + eBo/2m)t] . 

These are the equations of motion for a left-hand and a right-hand circular motion with 
the frequencies a>o±dco, with Sa> = eBo/2m. The component electron oscillators 2 
and 3 thus emit or absorb circularly polarised light with the frequency 

The splitting observed in the ordinary Zeeman effect is therefore correctly predicted 
in a classical model. 
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z<Boj 




Fig. 13.9. Precession of J and 
^ j about the direction of the 
applied field Bq. ordinary Zee- 
man effect, i.e., J = L 



The frequency change has the magnitude: 

Sv=Soj/2n = — —Bo. (13.10) 

4n rtio 

For a magnetic field strength Bq=\T, this yields the value 

Jv= 1.4- 10‘®s“* ^0.465 cm“‘ . (13.11) 

Independently of the frequency v, we obtain the same frequency shift for each 
spectral line with a given magnetic field Bq. Theory and experiment agree completely 
here. For the polarisation of the Zeeman components, we find the following predic- 
tions: component electron oscillator 1 has the radiation characteristics of a Hertzian 
dipole oscillator, oscillating in a direction parallel to Bq, In particular, the E vector of 
the emitted radiation oscillates parallel to Bq, and the intensity of the radiation is zero 
in the direction of Bq, This corresponds exactly to the experimental results for the un- 
shifted Zeeman component; it is also called the n component {n for parallel). If the 
radiation from the component electron oscillators 2 and 3 is observed in the direction 
of Bq, it is found to be circularly polarised; observed in the direction perpendicular to 
Bq, it is linearly polarised. This is also in agreement with the results of experiment. This 
radiation is called and o~ light, were a stands for perpendicular (German “senk- 
recht”) and the + and - signs for an increase or a decrease of the frequency. The 
light is right-circular polarised, the o~ light is left-circular polarised. The direction is 
defined relative to the lines of force of the Bq field, not relative to the propagation 
direction of the light. 

The differing polarisations of the Zeeman components are used in optical pumping. 
In this technique, the exciting light can be polarised so as to populate individual 
Zeeman levels selectively, and thus to produce a spin orientation. More about this in 
Sect. 13.5. 



13.3.3 Description of the Ordinary Zeeman Effect by the Vector Model 

In the preceding section, we gave a purely classical treatment of the ordinary Zeeman 
effect; we now take the first step towards a quantum mechanical description. For this 
pupose, we employ the vector model which has been already introduced in Sect. 12.2 
(cf. Fig. 13.9 and 13.11. Note that this figure illustrates the somewhat more complex 
case in which both orbital and spin magnetism play a role). A complete quantum 
mechanical treatment will be given in Chap. 14. The angular momentum vector j and 
the magnetic moment //y, which is coupled to j, precess together around the field axis 
Bq. The additional energy of the atom due to the magnetic field is then (Chap. 12 and 
Fig. 13.9 and 13.11) 

Vntj = = -^mjgjjUBBo with mj=j, j- 1, . . . -y . (13.12) 

Here the factor gi in (12.10) was replaced by gy, because the total angular momen- 
tum is being considered. 

The (2y-i- l)-fold directional degeneracy is thus lifted, and the term is split into 
2y-l- 1 components. These are energetically equidistant. The distance between two com- 
ponents with Amj= i is 

AE = gjiu^Bo . 
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Fig. 13.10. Ordinary Zeeman effect. Splitting of the A = 
6438 A line of the neutral Cd atom, transition 
into three components. The transitions with Arrij=0 
are called n transitions; those with Amj= ±1 are a 
transitions. The quantum number J is written as a 
capital letter because the atom has several electrons (see 
Chap. 17). Here, 5 = 0, and / = T : we are dealing with 
a case of purely orbital magnetism 



If we ignore the spin and consider only orbital magnetism (i.e. the ordinary Zeeman 
effect), Qj has a numerical value of 1 and we obtain 

8v = — —Bq. (13.13) 

An mo 

The magnitude of the splitting is thus the same as in classical theory. For optical 
transitions, one must also make use of the selection rule 

Amj = 0, ±1 . 

One thus obtains from quantum theory, too, the result that the number of lines is 
always three: the ordinary Zeeman triplet. 

As an example. Fig. 13.10 shows the splitting diagram for a cadmium line. We must 
point out that the orbital angular momentum for the states of the Cd atom comprises 
the orbital angular momenta of two electrons, and is therefore indicated by a capital 
letter L. The spins of the two electrons are antiparallel and thus compensate each other, 
giving a total spin S = 0. Transitions between the components of different terms (e.g. 

or ^£>2 in Fig. 13.10) with the same Amj are energetically the same. The splitting is 
equal in each case because only orbital magnetism is involved. [See the discussion of the 
Lande g factor in Sect. 13.3.5, especially (13.18).] The undisplaced line corresponds to 
transitions with Am = 0, while the displaced lines are the transitions with Am = ±1. 
They are circularly polarised. 

Polarisation and ordinary Zeeman splitting are a good example of the correspon- 
dence principle (Sect. 8.11). Based on the conservation of angular momentum for the 
system of electrons and light quanta, the polarisation behaviour of the Zeeman effect 
implies that light quanta have the angular momentum 1 • h. 
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Fig. 13.11. Left: The relation between the angular momentum /, the magnetic moment Hj and their orienta- 
tion with respect to the magnetic field Bq for strong spin-orbit coupling, cf. also Fig. 13.13. The angular 
momentum vectors S and L combine to form J. Likewise, the associated magnetic moments and com- 
bine to fij. Because spin and orbital magnetism have different gyromagnetic ratios, the directions of the 
vectors J and do not coincide. What can be observed is the projection of fij on /, as the time average 
of many precession cycles. That is, one observes the component {Mj)j, which is therefore represented as fij 
or see the right-hand diagram. In the one-electron system, lower case letters can be used instead of 5, 
L and /, as is done in the text. Right: The projection of fij on the vector / is see Fig. 13.14. The pro- 
jection of {fij)j on ^0 is calculated using the Lande factor. Because the angular momenta S and L are 
strongly coupled, the vector jj. j precesses rapidly around the negative extension of the vector /. Only the 
time average the / direction can be observed. This precesses slowly, because of weak coupling, 

around the axis of Bq. The magnetic energy is the product of the field strength Bq and the component of 
in the direction of Bq, i.e. {Rj)j^z Lower case letters can be used instead of 5, L, / in the 

one electron system. 

Figure 13.11 illustrates the anomalous Zeeman effect (Sect. 13.3.4). The ordinary Zeeman effect 
(Sect. 13.3.3) is more simple. From 5 = 0 follows //y = and the directions of the vectors - Hj and J = L 
coincide; see Fig. 13.9 

13.3.4 The Anomalous Zeeman Effect 

One speaks of the anomalous Zeeman effect when the angular momentum and mag- 
netic moment of the two terms between which an optical transition occurs cannot be 
described by just one of the two quantum numbers 5 or / (or S or L), but are determined 
by both. Refer also to Fig. 13.11. This is the general case, in which atomic magnetism 
is due to the superposition of spin and orbital magnetism. The term “anomalous” Zee- 
man effect is historical, and is actually contradictory, because this is the normal case. 

In cases of the anomalous Zeeman effect, the two terms involved in the optical tran- 
sition have different g factors, because the relative contributions of spin and orbital 
magnetism to the two states are different. The g factors are determined by the total 
angular momentum / and are therefore called gj factors. The splitting of the terms in 
the ground and excited states is therefore different, in contrast to the situation in the 
normal Zeeman effect. This produces a larger number of spectral lines. The calculation 
of the gj factors follows in Sect. 13.3.5. 

We will use the Na D lines (Fig. 13.12) as an example for a discussion of the 
anomalous Zeeman effect. 

For the three terms involved in the transitions which produce the Na D line, namely 
the ^Si/ 2 , the ^P \/2 and the the magnetic moments in the direction of the field are 

= , (13.14) 
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and the magnetic energy is 



Vmj= -ijUjhzBo- (13.15) 

The number of splitting components in the field is given by mj and is again 2y + 1. 
The distance between the components with different values of ruj - the so-called 
Zeeman components - is no longer the same for all terms, but depends on the quan- 
tum numbers /, 5, and j: 



~ (13.16) 

Experimentally, it is found that gj = 2 for the ground state ^Si/ 2 , 2/3 for the state 
'^P\/2 and 4/3 for the state ^^3/2- We shall explain these gj factors in the next section. For 
optical transitions, the selection rule is again = 0, ± 1. It yields the 10 lines shown 
in Fig. 13.12. The spectrum which is, in fact, observed is shown schematically in 
Fig. 13.13. 
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Fig. 13.12. Anomalous Zeeman effect. Splitting of the and D 2 
lines of the neutral Na atom, transitions ^5^/2 -^Pi /2 and 
^*^ 1 / 2 “ ^^ 3 / 2 ’ 4 and 6 components, respectively, in a magnetic 

field. Here, S=0 and we are dealing with purely orbital magnet- 
ism. The ^^ 3/2 state is higher in energy than the "Z’l/2 State; this is 
not shown in the figure. Compare also Fig. 12.18 
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Fig. 13.13. Energy splitting (in cm b of the D, and D 2 lines in a 
magnetic field of 3 T (Zeeman effect) 



The significance of the Zeeman effect is primarily its contribution to empirical term 
analysis. Term splitting depends unequivocally on the quantum numbers /, s and j or, 
in many-electron atoms, L, S and / (Chap. 17). The quantum numbers can therefore be 
determined empirically from measurements of the Zeeman effect. 



13.3.5 Magnetic Moments with Spin-Orbit Coupling 

In anomalous Zeeman splitting, other values of gj than 1 (orbital magnetism) or 2 (spin 
magnetism) are found. We can understand these quantitatively through the vector 
model. 

The gj factor links the magnitude of the magnetic moment of an atom to its total 
angular momentum. The magnetic moment is the vector sum of the orbital and spin 
magnetic moments. 
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The directions of the vectors jUi and I are antiparallel, as are those of the vectors jn^ 
and s. In contrast, the directions of j and - jAj do not in general coincide. This is a result 
of the difference in the g factors for spin and orbital magnetism. This is demonstrated 
in Figs. 13.14 and 13.11. 

The magnetic moment /ly resulting from vector addition of /// and in^ precesses 
around the total angular momentum vector y, the direction of which is fixed in space. 
Due to the strong coupling of the angular momenta, the precession is rapid. Therefore 
only the time average of its projection on j can be observed, since the other components 
cancel each other in time. This projection precesses in turn around the Bq axis of 
the applied magnetic field Bq. In the calculation of the magnetic contribution to the 
energy the projection of juj on the j axis (jHj)j must therefore be inserted in (13.15). 
Its magnitude can be calculated from the vector model: from Figs. 13.11 and 13.14, 
the j component of fij is 

\(jlj)j\ = \fii\cosil,j) + ||/s|cos(5,7) 

= MbI]/10+^) cos(/j) + 2|/5(5+1) cos(sJ)] . 



s 




Fig. 13 . 14 . Calculation of the J components of jUj and interpreta- 
tion of the differing g factors of orbital and spin magnetism. 
Again, lower case letters s, I and j apply to single-electron systems, 
upper case 5, L and J to many-electron systems 



The expressions for cos(/,y) and cos(s,j) are derived from Figs.13.14 and 13.11 
using the law of cosines. The length of the vectors is again ]/l(l+l)h or ]/5(5'+ 1)^, 
respectively. We shall present a deeper quantum theoretical justification for this in 
Sect. 14.3. 

We then have for the magnitude of (jUj)j, 






3j(j + 1) + 5(5 + 1) — /(/+ 1) 

21/7(7+1) 



y«B = 0yl//(/+l)y«B. 



(13.17) 



and for the moment 






with 



- 1 + 



7(/+l) + 5(5+ !)-/(/+!) 



2y (/•+!) 



(13.18) 
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and for the component in the z direction, 

(i“y)y,z= (13.19) 

The Lande factor qj defined in this way has a numerical value of 1 for pure orbital 
magnetism (s = 0) and 2 (more exactly, 2.0023) for pure spin magnetism (/= 0). For 
mixed magnetism, one observes values which differ from these two cases. By making 
the appropriate substitutions, one can easily see that the g factors given in the preceding 
section for the terms of the sodium atom are obtained from (13.18). In many-electron 
atoms, the quantum numbers 5, / and j are replaced by S, L and /, as already mentioned 
(but see Sect. 17.3.3). This has been done in Figs. 13.1 1 and 13.14. 



13.4 The Paschen-Back Effect 



The preceding considerations on the splitting of spectral lines in a magnetic field hold 
for “weak” magnetic fields. “Weak” means that the splitting of energy levels in the 
magnetic field is small compared to fine structure splitting; or, in other words, the 
coupling between the orbital and spin moments, the so-called spin-orbit coupling, is 
stronger than the coupling of either the spin or the orbital moment alone to the external 
magnetic field. Since spin-orbit coupling increases rapidly with increasing nuclear 
charge Z (Sect. 12.8), the conditions for a “strong” field are met at a much lower field 
with light atoms than with heavy atoms. For example, the spin-orbit splitting of the 
sodium D lines is 17.2 cm~\ while the splitting for the corresponding lines of the 
lithium atom is 0.3 cm~\ The Zeeman splitting in an external field Bq of 30 kG (3 T) is 
the same in both cases, about 1 cm“^ cf. Fig. 13.13. Thus this field is a “strong” 
magnetic field for lithium, but a “weak” field for sodium. 

When the magnetic field is strong enough so that the above condition is no 
longer fulfilled, the splitting picture is simplified. The magnetic field dissolves the fine 
structure coupling. / and s are, to a first approximation, uncoupled, and precess inde- 
pendently around ^o- The quantum number for the total angular momentum, j\ thus 
loses its meaning. This limiting case is called the Paschen-Back effect. 

The components of the orbital and spin moments {jui)^ and in the field direc- 
tion are now individually quantised. The corresponding magnetic energy is 





(13.20) 


and the splitting of the spectral lines is 




AE = {Ami+2Am^)ju^Bo. 


(13.21) 



For optical transitions, there are again selection rules, and as before, Am[=0 or 
± 1 for 7 T or cr transitions. Since electric dipole radiation cannot, to a first approxima- 
tion, effect a spin flip, it also holds that Arris = 0. With these rules, (13.21) yields a 
triplet of spectral lines like those of the ordinary Zeeman effect. 

Figure 13.15 shows the splitting scheme of the Na D lines. A vector model is shown 
in Fig. 13.16, which makes it clear that a total angular momentum vector j cannot even 
be defined here. Like the Zeeman effect, the Paschen-Back effect is chiefly used in 
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Fig. 13. 15 a -c. Paschen-Back effect (c) and Zeeman effect (b) 
with the D| and D2 lines of the neutral sodium atom (a). In the 
limiting case of strong magnetic fields, one observes one un- 
shifted and two symmetrically split lines, as in the ordinary 
Zeeman effect 




Fig. 13.16. Paschen-Back effect. In the limiting case of a strong 
magnetic field Bq, the spin S and orbital L angular momenta 
align independently with the field Bq. A total angular momen- 
tum J is not defined 



empirical term analysis. In many-electron atoms, where the single-electron quantum 
numbers y , /, and s are replaced by the many-electron quantum numbers /, L and 5, this 
method is especially important (Chap. 17). 

The area between the limiting cases of weak fields (Zeeman effect) and strong fields 
(Paschen-Back effect) is difficult to analyse, both theoretically and experimentally. 



13.5 Double Resonance and Optical Pumping 

One can make use of the difference in polarisation of the various Zeeman components 
in order to populate selectively individual Zeeman levels, even when the spectral resolu- 
tion is insufficient or the linewidth is too great to obtain the excited state otherwise. 
This is the simplest case of optical pumping. 

The first experiment of this type is represented in Fig. 13.17 (Brossel, Bitter and 
Kastler 1949- 1952). Mercury atoms in an external magnetic field Bq are excited by ir- 
radiation with linearly polarised light in a tt transition to the mj = 0 level of the 
excited state. The emission from these atoms is also linearly polarised n light. Now one 
can induce transitions Am = ±1 with a high-frequency coil perpendicular to Bq, as 
shown in Fig. 13.17, and thus populate the Zeeman substates m = 1 and m = -1, The 
light emitted from these levels, however, is circularly polarised a light. The emission 
of circularly polarised light in a direction perpendicular to that of the n emission can 
thus be used for the detection and measurement of Am= ±1 transitions between 
Zeeman substates. 
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Here, the same transitions as in electron spin resonance are observed, but they are 
detected optically. By means of this double resonance technique (double excitation with 
light and with high-frequency radiation), an extremely high detection sensitivity can be 
reached, because the high-frequency quanta with small quantum energies are detected 
via the much more energetic light quanta. In this way, the detection of spin resonance 
in a short-lived excited state becomes possible. Double resonance methods of this type 
have attained considerable importance in spectroscopy in the past 30 years. 

The principle of optical pumping may be explained conveniently using the example 
of the sodium D lines, e.g. the transition from the ^Sj /2 ground state to the ex- 
cited state. In an applied magnetic field, both terms are split into the Zeeman terms 
mj = ± 1/2 (Figs. 13.12 and 13.18). If the “pumping” light is now circularly polarised, 
for example as a'^ light, only transitions from mj= -1/2 in the ground state to the 
mj = + 1/2 excited state can take place, populating only the latter state. Emission from 
this state can occur either as light, leading to the mj = - 1/2 initial state, or as 
n light, leading to the ground state term with mj = -t- 1/2. Overall, this pumping cycle 
increases the population of the terms with mj = -l- 1/2 in the ground state at the cost of 
those with mj= -1/2. An equalisation of the populations can occur through relaxa- 
tion processes, for example by means of collisions of the Na atoms with one another or 



Bq=0 Bq*0 
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Bi - field coils 

4 — 
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Fig. 13.17. Double resonance, after the method of Brossel, Bitter 
and Kastler. In the upper part of the figure, the three Zeeman levels 
of the excited state are shown. The lower part shows the experi- 
mental arrangement. The mercury atoms are contained in a cuvette 
between two pairs of coils, which produce the constant field Pq ^nd 
the high-frequency field B^. The transition occurs between the 
ground state of the Hg atom 6 j^(^So) and the excited state 

6s6pep^) 



- High-frequency field coils 




Fig. 13.18. Optical pumping of the transition of 

the sodium atom. In the field Pq» the terms split up into Zeeman 
terms with mj= ±1/2. Only atoms in the ground state rrij = 
- 1/2 absorb the light with which the sample is irradiated, n 
transitions occurring in emission from the excited state lead to 
an increase in the population of atoms in the ground state with 
rrij = +1/2. With the high-frequency field, transitions from 
rrij = + 1/2 to my = - 1/2 are induced, increasing the number of 
atoms which are able to absorb the pumping light 
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with the walls of the container. If these processes are not sufficiently rapid, one can 
induce transitions in the ground state by irradiation with microwaves. These electron 
spin resonance transitions change the populations of the Zeeman terms. The detection 
of this ESR can be accomplished optically, namely through the change in the intensity 
of the absorption from ^Si/2, mj= -1/2 to ^Pi/2, rnj= +1/2, provided that the 
population of the ground state terms was altered by optical pumping. Figure 13.18 
shows the experimental arrangement schematically. This is thus also a double reso- 
nance method. 

Double resonance methods, in which magnetic resonance transitions are detected 
by means of the absorption or emission of light in the visible or ultraviolet spectral 
ranges, have also acquired considerable importance in molecular and solid-state phys- 
ics. They are termed ODMR, for optically-detected magnetic resonance. This subject 
is treated in more detail in Molecular Physics and Quantum Chemistry by H. Haken 
and H.C. Wolf, Sect. 19.7. 



Problems 

13.1 What frequency is required to induce electron spin transitions from the parallel 
to the antiparallel configuration, or vice versa, if the magnetic field is 10“^ tesla? 

13.2 Why is the term not split in a magnetic field? Explain this in terms of the 
vector model. 

13.3 Calculate the angle between the total and the orbital angular momenta in a 
state. 

13.4 The spectral lines corresponding to the 3p ^ 3s transition in sodium have the 
wavelengths A2 = 5895.9 A and = 5889,6 A. 

a) Calculate the magnetic field strength at which the lowest Zeeman level of the ^Py 2 
term would coincide with the highest level of the ^P \/2 term, if the conditions for the 
anomalous Zeeman effect were still fulfilled. 

b) How large are the frequency differences between the outer two components of the 
Dj line and of the D2 line in a magnetic field of 1 tesla? 

13.5 Discuss the splitting of the lines in the ^ 2p transition in the presence of a 
magnetic field when the Zeeman splitting is small compared to the spin-orbit interaction. 

13.6 Sketch the Zeeman splitting in the lines of the hydrogen atom Balmer series. 

Calculate the magnetic moments of the states P1/2, P3/2, Dy 2 ^nd D5/2. 

Also sketch the splitting in the Paschen-Back effect. At what magnetic field does the 
transition from the Zeeman to the Paschen-Back effect occur? 

13.7 a) Consider hydrogen atoms in a magnetic field 5 q = 4.5 tesla. At this field 
strength, is the splitting of the line (n = 3 ^n = 2) due to the anomalous Zeeman 
effect or the Paschen-Back effect? Support your answer. (The spin-orbit coupling 
between the 3^P\/2 and 3 terms of the hydrogen atom is 0.108 cm 
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b) Sketch the splitting of the energy levels in the given magnetic field and show the 
transitions which contribute to the line. Into how many components is the line 
split? 

c) Determine the specific charge e/m of the electron, given that the frequency 
splitting between two neighbouring components is 6.29 • 10^^ Hz. The fine structure 
can be ignored here. 

d) Is the wavelength splitting in the first line of the Lyman series {n = 2 = 1) 

smaller, larger or the same as that of the line? 

13.8 a) Sketch the energy levels of a free electron in a magnetic field as a function 
of the field strength. 

b) Consider two electrons whose spins are coupled (the precise coupling mechanism 
is not important for the problem; e.g. they could interact as two magnetic dipoles). 
How many possible orientations are there? Distinguish between singlet and triplet 
states. 

c) What is the minimum number of electrons which you must couple together in 
order to produce a sextet state? 

Hint: An x-tet state is named for its multiplicity. 




14. Atoms in a Magnetic Field: 
Quantum Mechanical Treatment 



14.1 Quantum Theory of the Ordinary Zeeman Effect 

The ordinary Zeeman effect is a beautiful example of the fact that even with classical 
physics, one can obtain results similar to those of strict quantum theory. In order to set 
our earlier results on a firm basis, however, we shall now go through the strict quantum 
theoretical treatment. 

This chapter is somewhat more demanding, because we shall have to make use of 
some of the basic theory of electromagnetism. As is shown in this theory, a magnetic 
field B can be expressed as the curl of the vector potential A : 

B = c\xr\A. (14.1) 

The electric field strength F ^ can be obtained in a similar way from the electric poten- 
tial V and the vector potential A according to the rule 

rJA 

F= -gradF- — . (14.2) 

dt 

Furthermore, we remember that the equation of motion of a particle with charge - e 
(we are thinking specifically of electrons here) and mass is 

mor = {-eW) + {-e){vxB), (14.3) 

The second term on the right is the so-called Lorentz force, v is the particle velocity. It 
can be shown that this equation of motion can be obtained, using the Hamilton 



equations 

p = -gxdidrHip.r) and (14.4) 

/•■= gradp/fO?,/*) , (14.5) 

from the Hamiltonian function 

H = -^{p + eAf+V. (14.6) 

2mo 

The potential energy V of the electron is related to the electric potential V:V= -eV. 



1 



In order to avoid confusion between the energy E and the electric field strength, we denote the latter by F 




224 



14. Atoms in a Magnetic Field: Quantum Mechanical Treatment 



At this point, it is only important to remember that in quantum theory, we always 
start from a Hamiltonian function. As we saw in Sect. 9.3.4, the Hamiltonian function 
is converted to an operator in quantum mechanics by using the Jordan rule to replace 
the momentum, according to 



P 




(14.7) 



By applying this technique here, we arrive at the Hamiltonian operator 

Jf = [(l/2/«o)[^/i)grad + e^]^+ V . (14.8) 



When we multiply out the squared term, taking care to maintain the order of the 
factors, we obtain 

fP' , he he e^A^ 

V^+ A grad + _^grad/4 + + V. (14.9) 

IrriQ IniQi IitiqI 

In applying the various differential operators, however, we must be careful, since we 
know that ^ is to operate on the wavefunction \\/, Thus we must interpret 



grad^ (14.10) 

exactly as 

grad(>lj//). (14.11) 

On differentiating the product in (14.11) and then again applying (14.7), we obtain for 
the Hamiltonian 

.^= --^V^+— ^/7 + -^divy4 + -i-:^+ F. (14.12) 

2/?7o 2/7Zoi 2 /Mo 



(The operators ^raJient, r//rergence, and curl used here are vector differential 
operators which are often abbreviated using the Nabla symbol V, with V/ = grad /, 
V-fsdivF, VxF^curlF, and V- V/= vV= Laplacian /, where / is a scalar 
function and F a vector function.) 

We now choose, as always in this book, the constant magnetic field B in the z 
direction: 



5 = (0,0,BJ. (14.13) 

It can be demonstrated that the vector potential^ in (14.1) cannot be uniquely deter- 
mined. One possible representation, which is convenient for the present calculation, is 








A, = 0. 



(14.14) 
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With this, the Schrodinger equation with the Hamiltonian (14.12) becomes 






2mn 



e h d 8 

X -y 

2mo i \ dy 8x 



In the following, we shall assume a spherically 
We recall the following relation from Sect. 



+ ^ (x^+y^) + V(r) 

8 mo 



y/ = Ey/ . 

(14.15) 



symmetrical potential for V. 
10 . 2 : 



— lx—- —] = l=— — 

i \ dx J ^ i 00 



(14.16) 



where ^ is the angular momentum operator in the z direction. In general, the term in 
(14.15) containing {x^+y^) can be neglected in comparison to the preceding term with 
4, if the magnetic field is not too large, and as long as the magnetic quantum number 
m 4=0. Leaving out the term with and using the usual formula for the wave- 

function. 



M,(r) = Rn,Ar)e"'^vf(cosd), 



(14.17) 



we recognise that (14.15) is identically satisfied. The energy is now 

E = + (14.18) 

2 mo 

The energy E is thus shifted with respect to the unperturbed energy E„ by an amount 
which depends on the magnetic quantum number m, and the energy level is split. The 
factor ju^ = eh/{2mo) is the Bohr magneton which was introduced earlier. With the 
addition of the selection rules for optical transitions. 



Am = 0 or ± 1 , 



the above derivation leads to the splitting of spectral lines known as the ordinary 
Zeeman effect (Sect. 13.3). 



14.2 Quantum Theoretical Treatment of the Electron and Proton Spins 

14.2.1 Spin as Angular Momentum 

As we saw in Sect. 12.4, the electron has three degrees of freedom in its translational 
motion, and a fourth in its spin. As we know, a number of other elementary particles, 
including the proton, have spins too. Our quantum mechanical calculations to this 
point, especially our derivation of the Schrodinger equation and its application to the 
hydrogen atom, have not included spin. In the following, we shall show how spin is 
included in the quantum theoretical treatment of atomic states. This is necessary, for 
example in spin-orbit coupling, in the anomalous Zeeman effect, in spin resonance. 
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and in an adequate formulation of the Pauli principle, which will be discussed later. 
Like every angular momentum, the spin of the electron is a vector with three spatial 
components Sx, Sy and s^: 

s = (Sx,Sy,s^) . (14.19) 

In the following development of the spin formalism, we must account for the ex- 
perimental observation that the spin has only two possible orientations such that the 
spin component in a chosen direction, e.g. the z direction, can only have the value 
+ h/2 or - h/2. In this sense, it is a genuine two-level system. 



14.2.2 Spin Operators, Spin Matrices and Spin Wavefunctions 

Since it is intuitive to think of one of the states of spin as “spin up” and the other as 
“spin down”, we shall first introduce in a purely formal way two “wave” functions 
which correspond to these spin directions, i.e. and 0i. If we proceed strictly accord- 
ing to quantum formalism, measurement of the z component of the spin corresponds to 
applying the operator to a wavefunction. (As with the angular momentum I, we 
distinguish the spin operator from the corresponding classical parameter by using the 
“hat” sign.) We can choose the wavefunctions in such a way that the application of the 
operator gives the observed values of the wavefunction. Because we have only two 
observed values, namely h/2 and - h/2, we expect that 




and 



(14.20 a) 




(14.20b) 



These can be summarised as 



where Wj = +1/2 (corresponding to t) or 
Ws = - 1/2 (corresponding to i). 



(14.21) 



nis is thus the quantum number of the z component of the spin. 

We are now looking for a formalism which will more or less automatically give us 
the relations (14.20a, b). It has been found that this is most easily done by using 
matrices. A matrbc, in mathematics, is a square array, for example 



M = 



a b 
c d 



(14.22) 



There is a multiplication rule for this array. As an example, let us imagine a vector v 

with the components x and y in a plane, or v = ( ^ ) . We can produce a new vector 
/x\ 

x',y' hy multiplying 1 by M. This is done according to the rule 

\yJ 
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X' 

y\ 



= M[ 



a b 
c d 



X 

y ^ 



ax+by 
cx + dy 



(14.23) 

We thus are looking for a “vector” 0 and a matrix M such that M0 yields exactly either 

A^or - 
2 2 

We shall simply give the result, and then verify it. We choose in the form 



2 \0 -1 / 

and the spin functions in the form 



(14.24) 






0 



0i = 



1 



(14.25) 



With the help of (14.23), it can be immediately calculated that substitution of (14.24 
and 25) in (14.20a and b) actually yields the relations M0^ = (h/2)0], M0i = 
-(h/2)0i. We obtain the most general spin function by superposition of and 0i 
with the coefficients a and b, as we have done before with wave packets: 



0 = a0^ + b0i = 



(14.26) 



In order to arrive at a normalisation condition, we must now introduce the “scalar 
product” for the 0 ’s. If we have a general 0 i in the form 



01 = 



*1 



and another 02 in the form 



(14.27) 



02 - 



ai 



we define the scalar product as 



(14.28) 



01 02 = (albt) j = (at G2+bt b2 ) . (14.29) 

These are calculation rules, which should be familiar to the reader from vector calcula- 
tions. If we substitute in (14.29) 0 i = 0 |, 02 = 0|, we obtain 



0T0t= 1 



(14.30) 
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and correspondingly, 



0i(t)[=\. (14.31) 

Thus the wavefunctions are normalised. With and 02 = 0t> we have 



= 0 



(14.32) 



i.e. the wavefunctions are mutually orthogonal. 

With (14.24), we have the first part of the solution of the entire problem. The repre- 
sentation of the operators for the x and y directions of the angular momentum is 
naturally still open. Because we are talking about angular momenta, it seems reason- 
able to require the usual commutation relations for angular momenta (10.14). We do 
not wish to go into the mathematics of the problem here. For the purposes of this book, 
it is sufficient simply to choose and Sy appropriately. It turns out that 



and 



0 1 
1 0 

0 -i 

1 0 



(14.33 a) 



(14.33b) 



are suitable. If we calculates^ = §l+Sy + sl with the matrices (14.24, 33a and 33b), we 
obtain after a short calculation 



4 V 0 3 / 4 V 0 1 






— • (unit matrix) . 
4 



Therefore, it we apply 5 ^ to any spin function 0, in particular to 0^^, it will always yield 

The analogy between this equation and the eigenvalue equation for the orbital angular 
momentum with the eigenvalue ^^/(/+ 1) (10.6) is especially clear if we write 3/4 
in the form h^s(s+ 1), with s = 1/2: 






(14.34) 



14.2.3 The Schrodinger Equation of a Spin in a Magnetic Field 

We shall now proceed to the formulation of a Schrodinger equation for the spin in a 
magnetic field. A magnetic moment 

eh 
IrriQ 



- 



(14.35) 




14.2 Quantum Theoretical Treatment of the Electron and Proton Spins 



229 



is associated with the electron spin of h/2. Here Wq is the rest mass of the electron and e 
is the positive unit charge. This magnetic moment, the “Bohr magneton”, was 
presented in Sect. 12.2. Since the magnetic moment is a vector oriented antiparallel to 
the electron spin, we can write more generally 

(14.36) 

mo 

where the factor h/2 is now naturally included in the angular momentum s. The 
following calculations can be directly applied to the spin of a proton, if the Bohr 
magneton is consistently replaced by the so-called nuclear magneton -ju^ and 
- e/mo by e/m^, is defined as - (mo/m^) and m^ is the mass of the proton. The 
negative sign comes from the fact that the charge of the proton is the negative of the 
electron charge. 

The energy of a spin in a spatially homogeneous magnetic field i? is, as is shown in 
electrodynamics, 

Vs=-juB. (14.37) 

We are trying to find an equation analogous to the Schrodinger equation, and we 
realise from the previous discussion of quantum mechanics that the Schrodinger 
equation was obtained from energy expressions (Sect. 9.2). There the energy expres- 
sions were the Hamilton functions, which were then converted to the Hamiltonian 
operator. In a similar way, we now make the energy expression (14.37) into an operator 
and write the equation 

—B-S<p = E0. (14.38) 

mo 

If the magnetic field has the components By and B^, the left side of (14.38) is 

{Bj(Sx-\- BySy-\- B^s^ (j) . (14.39) 

mo 



Now §x, §y and are the matrices (14.33 a, b and 24), respectively. Therefore (14.39) is 
also a matrix. According to the rules for the addition of matrices, it is 

eh / B^ B^-iBy 
2mQ\Bx+iBy -B^ 

The characteristic of being an operator thus accrues to the left side of (14.38) from 
§, which was defined above as the spin operator. If we choose the field B in the z 
direction, as above, 

S = (0,0,B,), (14.41) 

the left side of (14.38) is the same, except for the numerical factor eB^/nio, as the left 
side of (14.20a or b) which shows us that the functions introduced above (14.25) are 
also eigenfunctions of the operator in (14.38) with the corresponding eigenvalues 



(14.40) 
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E=±n^B,. (14.42) 

The spin energy in a constant magnetic field in the z direction is thus just given by the 
expression which we would expect in classical theory for the interaction of an anti- 
parallel spin moment with a magnetic field. Of course, instead of (14.38), we could 
have formulated the corresponding time-dependent SchrOdinger equation 

J-B-S(t> = \h^. (14.43) 

mo dt 

This equation must be used, in particular, if we are dealing with a time-dependent 
magnetic field. 



14.2.4 Description of Spin Precession by Expectation Values 

It is, however, also interesting to determine the time-dependent solution of (14.43) for a 
constant magnetic field. If we choose a magnetic field in the z direction, the Schrddin- 
ger equation is given by 

H^bJ\ ^\d> = \h^. (14.44) 

\ 0 - 1 / dt 

The general solution is found as a superposition of (14.26). Since the 

Schrddinger equation contains a derivative with respect to time on the right-hand side, 
we have to include in and 0\, the corresponding time functions 

exp(-i£'^t/li) and tx]?{-iE],t/h) , 

where E‘\ and E[ may be written in the form 

E^ = (fi/2) a>o, Ei= - (h/2) coo , and (Oq = —B^. (14.45) 

mo 



Since a linear combination may also contain constant coefficients, we use the more 
general trial solution for (14.44): 

0(0 = aexp(-icuo^/2)0t + Z?exp(icuo^/2)0i . (14.46) 

We require 0 to be normalised, as always in quantum mechanics, i.e. that the scalar 
product 00 (14.29) be equal to one. This means 

\a\^+\b\^=\. (14.47) 

The physical meaning of (14.46) will become clear when we form the expectation 
value of the spin operator s with this wavefunction. To do this, we must first recall how 
expectation values are to be calculated, and refer to Sect. 9.3. The “recipe” given there 
states: 
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1) Take the wavefunction y/, 

2) allow the “operator for the measurable quantity” Q of which the expectation value 
is to be found, to operate on it, 

3) then multiply with y/* and integrate: 

\y/*(x)Qy/{x)dx . 

The steps 1-3) can easily be transformed into three analogous rules for calculating 
with the spin formalism: 

1) Take the spinfunction <j), e.g. (14.46), 

2) let the spin operator s^, Sy, or operate on (14.46), i.e. form, for example 5^0; 

3) multiplication by 0 and integration are replaced by the rules for calculating the 
scalar product: 

we multiply 0 from the left by 0. 

As an abbreviation we set 



aexp(-icuof/2) = a, 

6exp(imot/2) = P . 

The individual steps 1-3) are now as follows: 



P. 



1) 0= a(p^ + p<pi = I 

2) 5^0 = — ( , , 

2 Vo -iJ\P, 

Using the rule (14.23), this is equal to 

!(-;) 

- fi / a 



3) 0S, 0 = 0 

2 \-p^ 

According to rule (14.29), the right-hand side is equal to 

y(|«p-|j6p). 

Writing the expectation value of as <5^>, we have found: 

<^z> = y(|ap-|AP)- 

We leave it to the reader as an exercise to show that 



<Sjc> = —ia*P+ aP *) , 



(14.48) 



(14.49) 

(14.50) 



(14.51) 

(14.52) 



(14.53) 



(14.54) 
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and 

{§y) = —\{a^*-a*p). (14.55) 

2 

Since we can see all the essentials by assuming a and b in (14.48) to be real numbers, 
we shall do so and insert (14.48) into (14.53 - 55). This yields 

= ^(a^-b^) = const w.r.t. time . (14.56) 

The expectation value of the z component of the spin thus remains constant in time. 

= abh cos ojQt , (14.57) 

{§y) = abh sin coot . (14.58) 

The component of the spin in the x-y plane rotates with the angular velocity coq. The 
expectation values (14.56-58) can be interpreted as a precessional motion of the spin 
(Fig. 14.1). Thus the model used in Chap. 13 is justified by quantum theory. 




14.3 Quantum Mechanical Treatment of the Anomalous Zeeman Effect 
with Spin-Orbit Coupling* 

In this section we shall continue the complete quantum mechanical treatment of spin- 
orbit coupling. The goal is to give an exact justification for the vector model of spin- 
orbit coupling introduced in Chap. 12. Specifically, we are concerned with LS coupling 
and wish to show the justification for the rule that /^, and p can be replaced by 
1(1+ i), 5 ( 5 -f-1) and y'C/'+l), respectively. If we ignore spin-orbit coupling for a 
moment, then the energies of the orbital motion and of the spin (magnetic moment) in 
a magnetic field are additive. This means that the total Hamiltonian is simply the sum 
of the Hamiltonians for orbital motion (14.8) and for the spin (14.38). Thus we have 
the Schrddinger equation 

— ^ — (—gxdid + ey^ +V+-^s • B 

2mo \ i / mo 



\f/= ih 



d\(/ 

~aT 



(14.59) 
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This is also known in the literature as the Pauli equation. 

Because the Hamiltonians (14.8) and (14.38) are additive and apply to entirely dif- 
ferent degrees of freedom, the wavefunction y/ can be written as a product of the wave- 
function of the orbital motion and that of the spin motion. Finally, we can also treat 
the spin-orbit coupling introduced in Sect. 12.8 quantum mechanically. For this we 
need only to introduce the previously derived expression (12.27), which represents an 
interaction energy, into quantum mechanics. This is done, as usual, by assigning the 
angular momentum / to the angular momentum operator I {SqqX. 9.3.4.) and the spin 5 
to the spin operator s [compare with (9.61)]. The resulting term 



Wits) = 



HoZe^ 

8nmo 




Anr^ 



( /^orbit ' 



/^spin) 



(14.60) 



is introduced into the SchrOdinger equation to give the Schrddinger equation of an 
electron with spin in a magnetic field, where spin-orbit coupling is taken into account. 
The time-independent form of this equation is 



^2 . j j- . 

V -f A grad -I- dwA + 

2 mo mpi 2wjoi 



e^A‘ 



2 mo 



+ V 



+ —S-B + 
mo 



MoZe^ 

8nmo 




If/ = Ey/ . 



(14.61) 



As we saw in Sect. 13.3, spin-orbit coupling dominates in low magnetic fields. 
Therefore we shall first examine the SchrOdinger equation in the absence of a magnetic 
field: 



2mo Aneor 8nmo r 



y/ir) = E y/(r) . 



(14.62) 



Equation (14.62) includes the spin operator S which, as we know, is a matrix. Therefore 
the wavefunction y/(r) has two components: 



y/{r) = 



Vdr) \ ^ 

Wiir) I ’ 



where y/i corresponds to spin t and y/i to spin i. 

Spin-orbit coupling mixes orbital and spin states, and makes it necessary to 
introduce new quantum numbers. Without spin-orbit coupling, the wavefunction 
would have the form 



Wn,l,m,m, ~ 0 ) 0m, • 

' V ' 

orbit spin 



(14.63)^ 



The letter 0 in (14.63) has two entirely different meanings: in F/ 0), it indicates an angular coordinate, 
while in 0^^ it indicates one of the spin wavefunctions (14.25) 



2 
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It is characterised by the principal quantum number n, the orbital angular momentum 
quantum number /, the magnetic quantum number = and the spin quantum 
number In order to determine the quantum numbers applicable to spin-orbit 
coupling, we must expand on the considerations on orbital angular momentum 
presented in Sect. 10.2, and examine the parameters to decide which can be observed 
simultaneously. As we know, this can be done with the help of commutation relations 
(Sect. 9.3). If, as in Sect. 12.7, we introduce the total spin operator / = l + s, and its 
component in the z direction, /^, the following parameters can be observed to any 
desired precision simultaneously: 



The square of the orbital angular momentum / ^ 

The square of the spin 

The square of the total angular momentum 

Component 

I • s and j - s . 

Because / • s occurs in (14.62), we can characterise the wavefunction by choosing those 
quantum numbers which are eigenvalues for the operators p, P, f ^ and /^. We there- 
fore obtain the following relations between operators and quantum numbers 

j^: quantum number y p quantum number my 

(14.64) 

s^: quantum number 5 P: quantum number / . 



Since the spin-orbit coupling is much smaller than the term spacing, the prin- 
cipal quantum number n is still a good quantum number, i.e. it still characterises 
the eigenfunction to a good approximation. The wavefunction is now character- 
ised by 

Wnj^mpi.s = • (Function of angle and spin) . (14.65) 



The spin-orbit coupling leads to the relative orientations of the spin and orbital 
moments, as was discussed in detail in Sect. 12.8. 

We now examine the effect of a magnetic field on an electron, taking spin- 
orbit coupling into account. It can be shown that in the Schrodinger equation (14.59), 
the term is much smaller than the other terms, if the magnetic field is not too 
large, and can be ignored. Let us again choose the magnetic field B in the z direc- 
tion and 



Ax = - Y^y > ^nd = 0 . 



div^ is then zero. The Schrodinger equation is then 
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mo 



Snmor 



\l/ = Eii/. (14.66) 






Wr 



magn 



w. 



spin-orbit 



We are treating the case of a weak magnetic field in which the spin-orbit coupling is 
larger than the interaction with the external magnetic field. We are now in a position to 
justify quantum mechanically the vector model introduced in Chap. 13. Let us consider 
the operator occurring in (14.66): 



JO ^ T3 

=^(4+25.) = — (4 + ^.) (14.67) 

IrriQ IrtiQ 

(it leads to an additional magnetic energy, which we called Sects. 13.3.4, 5). If we 
here had 4+^,; instead of 4+2^^, the solution would be very simple, and analogous to 
the treatment of an electron without a spin in the magnetic field (Sect. 14.1). In that 
case, the wavefunction \j/, which is already characterised by the quantum number ruj, 
would also be an eigenfunction of the operator 4 = 4+'^z- We must therefore see how 
we can deal with the additional in (14.67). Let us consider 

V ^ , (14.68) 



which can be rewritten as 

x~ J x~^ i^zJy~Jz^y)Jy • (14.69) 

V / 

Q 

It can be shown that the matrix elements of the operator q disappear when it is 
applied to wavefunctions with the same quantum number y, or, in other words, the 
operator q can only couple wavefunctions with different values of j. If the externally 
applied field is small, we can also expect that such transitions will make only a small 
contribution and can therefore be ignored. In the following, we shall therefore leave 
out the operator q. With this approximation, (14.68) can then be written as 

= (14-70) 

where we have replaced 5 • / by the corresponding expression on the right side of 
(14.70). It is important to note that all the parameters in (14.70) are operators. We now 
apply both sides of (14.70) to a wavefunction |//, which is characterised by the quantum 
numbers y, mj, /, and 5. We obtain 

■ fi^J(J+0v 

n t T T 

Operators Operator Numbers 

= fi^-Jz- iUU+ 1) - /(/+ 1) +5(5+ 1)] V' • (14.71) 

T ' ^ ' 

Operator Numbers 
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If we divide the right half of the double equation (14.71) by h^j(j + we obtain 



2y(/ + l) 

If we write ITmagn (14.67) in the form 



W - ( f + 5 ) 

^^magn ^ » 



5=1/2. 



(14.72) 



2ff?o I I 

Operators 

we finally obtain 



(14.73) 



^^agn W ■ 



eB 



2 m, 



' Jz * 

0 t L 



1 + 



y(/ + !)-/(/+!) + 5(5+1) 



27(/+1) 






(14.74) 



Operator 



Numbers 



The additional energy due to the orientation of the total moment j in the magnetic field 
is represented by (14.74). 

If we write the energy change of a quantum state n, j, I, mj in the form 



ch 

= - Bg-mj, 

' 2mo 

we can infer the Lande factor by comparison with (14.74) to be 
y(/ + l)-/(/+l) + 5(5+l) 



(14.75) 



^ = 1 + 



2y(/ + l) 



(14.76) 



We derived this Lande factor earlier, in an intuitive way with the help of the vector 
model, but we had to make use of the law of cosines in an ad hoc fashion when we 
replaced/ ^ by y (/ + 1) by /(/+ 1) and s'^ by 5 ( 5 + 1) The quantum mechanical 

calculation presented here gives the exact basis for this substitution. 



14.4 Quantum Theory of a Spin in Mutually Perpendicular 
Magnetic Fields, One Constant and One Time Dependent 

A number of important experiments on spin have been carried out with the following 
arrangement: both a constant, spatially homogeneous magnetic field in the z direction 
and an oscillating field in the x-y plane are applied. We shall see that this leads to the 
interesting phenomenon of spin flipping. These experiments make possible, among 
other things, the exact measurement of magnetic moments, and permit detailed 
analysis of the structure of and relaxation processes in liquids and solids. 

We shall see that we can easily solve these problems using the spin formalism 
introduced in Sect. 14.2. We write the magnetic field expressed as a time-dependent and 
a time-independent part: 
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B = (14.77) 

where the vectors of the magnetic fields are defined as 

Bq = (0,0,B^) and (14.78) 

B\t) = {B^At),B;{t),0). (14.79) 

Naturally we cannot expect that the spin will always point up or down in a time- 
dependent magnetic field. Rather, we must expect time-dependent transitions. We take 
these into account by writing the wave function which is to be a solution of the 
Schrodinger equation (14.43) in the general form 



0(0 = q(O0t + C2(O0i= j . (14.80) 

To arrive at equations for the still unknown coefficients Cj and C 2 , we substitute (14.80) 
in (14.43), observing the decomposition (14.77-79). If we multiply (14.39) out like a 
normal scalar product and observe the matrix form of s^, Sy, and - see (14.40) - we 
obtain the Schrodinger equation (14.43) in the form 



I 






Bi-is; 



Cl 

^ 2 , 



ih 



Cl 
Cl . 



(14.81) 



If we multiply the matrices according to the rule (14.23), we obtain these equations 
instead of (14.81): 



(,\ho3a)ci->r fi^{Bl-'iBl)C2= '\hCi , (14.82) 

H^(Bl+ iBy)Cx~\h ojqCi = \hC2. (14.83) 



Here we have introduced the frequency 

^«o = 2//b5“ (14.84) 

as an abbreviation. In order to simplify the following calculation, let us think of the 
transverse magnetic field as rotating with the frequency cu. In other words, the 
magnetic field has the form 

Bl = Fcoscot , 

(14.85) 

By = Fsmcot . 

Since Bl and By appear in (14.82, 83) in a combined form, let us first consider these 
expressions. We can express them as an exponential function, due to elementary rela- 
tionships between sines and cosines: 

jB^±i5^ = F(coso;/±isincuO = FQxp(±icot ) . (14.86) 

Then (14.82, 83) simplify to 
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(^o>o/2)ci + ;«B^exp(-i(yOc2 = . (14.87) 

fi^FQxp(\o)t)Ci-(ho}Q/2)c2 = \hci. (14.88) 

We shall solve these two equations in two steps. In the first, we put the coefficients 
Cj{t) into the form 

C\it) = cfi(0 exp(-i£Uot/2) ; C 2 (t) = diit) exp(icuot/2) . (14.89) 

If we differentiate (14.89) with respect to time and rearrange slightly, we obtain 

i^Ci = (^coo/2)ci + i^rfiexp(-icuot/2) . (14.90) 

If we substitute this in (14.87), we see that the term {hcoQ/Tfc^ on both sides cancels 
out. The same thing happens with C 2 in (14.88), so that (14.87) and (14.88) simplify to 

(OQ)t]d2= '\hdi , (14.91) 

yUBFexp[i(a)-cuo)t]c(i = i^^2- (14.92) 

These equations become very simple when we set the rotational frequency of the 
magnetic field cu equal to the spin frequency coq: 

CO = coq . (14.93) 

We then obtain 



H^Fd2=ihdi, (14.94) 

H^Fdi = ihd2. (14.95) 

To solve these equations, we first take the time derivative of (14.94): 

^l^Fd2 = ihd^, (14.96) 

and then, according to (14.95), we replace d 2 by (ji^Fd{)/{\h), and thus obtain 






(14.97) 



If we simplify the expression by setting n^iF/h = Q, we recognise (14.97) as a typical 
oscillator equation with the general solution 

di = as\n{Qt+d>) , (14.98) 

where the amplitude a and phase ^ are free to vary. Using (14.98) and (14.94) we obtain 

c /2 = iacos(f3t+ 0) . (14.99) 

With the proper choice of the zero time, we can set ^ = 0. The normalisation condi- 
tions for the spin wavefunction require that a=\. If we substitute (14.99) in (14.89) 
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and this in (14.80), and do the same with (14.98), we obtain the desired spin wavefunc- 
tion 

0(0 = sin(i30 exp( - i o>ot/2) 0 | + i cos(Qt) exp(i cyo02) 0 ^ . (14.100) 

The spin functions and the spin formalism naturally seem very unintuitive. In order 
to see the meaning of the above equations, let us remember that the immediate predic- 
tions of quantum mechanics can be read from the corresponding expectation values 
(Sect. 9.3). We will first develop the expectation value of the spin operator in the z 
direction. A comparison of (14.49) with (14.100) shows that we can now express the a 
and p of (14.49) in the form 

a = sin(f2r) exp( - i u>ot/2 ) , 

(14.101) 

P = i cos(Qt) exp(i (Oot/2) . 



These can be immediately substituted into the end results (14.53 - 55), however, to give 



<5j> = (h/2) sin^(I?t) - cos^(Qt) 
= -(h/2) cos(2Qt) . 



(14.102) 



According to (14.102), the z component of the spin oscillates with the frequency 2 Q. If 
the spin is originally down at t = 0, it flips up, then down again, and so on. 

For the other components. 



<^x> = 



- — sin(2i30 sin(cuoO , 
2 



(14.103) 



= 



— sin(2i2r) cos(o>oO 
2 



(14.104) 



These equations indicate that the spin motion in the x-y plane is a superposition of two 
motions, a rapid rotational motion with the frequency coq and a modulation with the 
frequency 2 The entire result (14.102-104) can be very easily interpreted if we think 
of the expectation value of the spin as a vector s with the components (Sx), iSy}, and 
<5^>. Obviously the projection of the vector on the z axis is ( - h/2) cos (2 Qt), while the 
projection in the x-y plane is (h/2) sin(2i2t). As can be seen from the formulae, the 
spin gradually tips out of the - z direction toward the horizontal, and then further into 
the + z direction, while simultaneously precessing. The spin thus behaves exactly like a 
top under the influence of external forces, as we indicated in previous chapters. 

We shall consider this process again, in more detail. At a time t = 0, 



(s,)=-h/2. (14.105) 

We now ask when the spin, considered intuitively, is in the horizontal position, i.e. 
when 



<^z> = 0 . 



(14.106) 
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This is clearly the case when the cosine function vanishes, that is when 

2Qt= n/2 (14.107) 

holds, or when the time 

t = n/(^Q) = nh/(^ii^F) (14.108) 

has passed. If one allows the transverse magnetic field to act upon the spins for this 
time, they will be pointing in the horizontal position (Fig. 14.2). In other words, they 
have been rotated by an angle n/2. We therefore speak of a n/2 or of a 90° pulse. 
Naturally, we may allow the magnetic field to act for a longer time, for example until 
the spins are pointing up, i.e. 

{s^) = h/2, (14.109) 

This occurs when 

cos(2I20= -1 (14.110) 

is fulfilled, i.e. after the time 



t = nh/(2ju^F ) . 

In this case, we speak of a tt or of a 180° pulse (Fig. 14.2). 



(14.111) 





Fig. 14.2. Left: Spin flip through n/2. Right: 
Spin flip through n 



With these considerations, we have sketched the most important traits of spin 
resonance. By applying a rotating magnetic field, we can cause the spin to flip from one 
direction to another. In practice, of course, one does not apply a magnetic field 
rotating with the spin frequency, but a linearly oscillating magnetic field. This can be 
pictured as a superposition of two fields rotating in opposite directions. Then one of 
the fields rotates with the spin, as before, while the other rotates with twice the fre- 
quency, as seen from the point of view of the rotating spin system. 

The corresponding equations have practically the same form as those above, except 
for an additional, rapidly oscillating term, which comes from the “oppositely rotating” 
magnetic field. To a good approximation, this can be ignored; the result is the “rotat- 
ing wave approximation” . 
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14.5 The Bloch Equations 

As we have just seen, the behaviour of the expectation values of the spin operators can 
be very simply interpreted. It is thus reasonable to ask whether it would not be possible 
to derive equations for these expectation values themselves. This is in fact the case. To 
derive these equations, we use the explicit form which we have just derived for the 
expectation values of the spin operator. We differentiate with respect to time and 
make use of (14.103) to obtain 

d 

— {Sx) = -(^/2)2Dcos(2^20sin(cuoO-(^/2)cuosin(2^20cos(cooO • (14.112) 

dt 

The first term on the right side contains - {h/2) cos(2Qt), which, however, is none 
other than the expectation value of the z component of the spin. We also recognise that 
the second term on the right side contains the expectation value of the y component of 
the spin. Equation (14.112) therefore has the form 

— (S;,) = /i~^2jUBFsm(cooO<s^>-coo<Sj,> . (14.113) 

dt 

However, we have seen the factors in front of the expectation values on the right side 
before. F sin(cuoO is just By, while coq is proportional to If we also take into 
account the relationships (14.84) and (14.85), (14.112) becomes 

— = — (14.114) 

dt rriQ rriQ 

In a similar way, we find that the time derivative of the y component of the spin is 

= ~—<S,)B,+ —BM . (14.115) 

dt rriQ ftiQ 

If we differentiate the expression (14.102) for <5^>, we immediately obtain 

—{s,y = —2Qsm(2Qt). (14.116) 

dt 2 

Since we expect that the right side of (14.116) can be expressed in terms of the expecta- 
tion values of the spin components, like (14.114) and (14.115), we take advantage of 
the relation 

sin^ ojot y cos^ ojot = 1 (14.117) 

to write the right side of (14.116) in the form 



hQsm(2Qt)[sin(coot) sin(cooO + cos(cuoO cos(cuo01 • 



(14.118) 
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It is now easy to convince oneself that (14.116) can also be written in the form 

(14.119) 

at niQ mo 

Equations (14.114, 115 and 119) can be written in the form 

— a>=/ixB, (14.120) 

dt 



as can be easily seen from the rules of vector multiplication. Here we have assembled 
the expectation values of the three components of the spin operator into the vector 



<«> = 



iSy) 

L<^vJ 



(14.121) 



This is strongly reminiscent of the torque equation for a top, if we identify s as the 
angular momentum and take into account that 



mo 



(14.122) 



Equation (14.120) is not quite adequate for the interpretation of many experiments, 
because in many cases, the spin of the particle interacts with its environment. For 
example, the orbital motion of the spins is continually perturbed by lattice oscillations. 
This results in continual phase shifts in the precession of the spin. In this case, it is no 
longer sufficient to regard the equations of a single spin as representative for those of 
all the spins, as we have implicitly done up to this point. Instead we must consider an 
“ensemble” of spins, and accordingly, we must in a certain sense subject the pure 
quantum mechanical expectation values we have used so far to another averaging 
process. We have to take into account the fact, for instance, that the x component of 
the spin no longer has a definite value at a definite time, but rather a distribution of 
values. As time passes, the distribution of values widens out, so that the probability 
that the value of (s^) is positive approaches the same value as the probability that it is 
negative. This means, however, that in the course of time, the average value of 5-;^ goes 
to zero. In order to take this decay into account, we add more terms to (14.120) which 
reflect this incoherent spin motion. 

These qualitative considerations are reflected by the phenomenological rule 

^<4>incoh= (14.123) 

at 12 

Since and Sy play the same role, we must naturally assume the corresponding rule for 
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Since the spins precess around the z axis, (14.123) and (14.124) indicate how quickly 
the components transverse to decay. Ti is therefore often called the transverse relaxa- 
tion time. It is a measure of the speed with which the individual precessional move- 
ments of the spins get out of phase. 

Since the z component of the spin is directed along a predetermined, constant field, 
it must be treated differently from the other two. In this case, too, we would expect a 
relaxation due to the interaction of the spin with its environment. It will naturally 
depend upon the orientation of the spin with respect to the external magnetic field - 
whether the field lies in the positive or negative z direction. The spin can give up energy 
through its coupling to the environment, and will attempt to reach the lowest state if 
the environment is at the absolute zero of temperature, T = 0. On the other hand, if the 
environment is at a finite temperature, the system of the spins and their environment 
will attempt to come to thermal equilibrium. At thermal equilibrium, some of the spins 
will be in the higher state, and others in the lower. If the spin system is displaced from 
thermal equilibrium, it will naturally attempt to return to it, and in a certain time 
interval which we call Tj. T\ is often referred to as the longitudinal relaxation time. 
What we have just said can be put in mathematical form, if we take 



(‘S'^)incoh 

at 



Sq- ih) 

71 



(14.125) 



for the incoherent relaxation of <5^>. Here Sq is the value of <5^> which the spin com- 
ponent would assume at thermal equilibrium. We arrive at the Bloch equations by 
adding the “incoherent” terms (14.123-125) to the equation (14.120) describing the 
“coherent” motion of the spin. 

The Bloch equations thus have the form 



— <f> = - ~^{s) xB + 
dt mo 





Sq- <Sz> 

71 



(14.126) 



The relaxation times 71 and 7^ are a measure of the strength of the coupling of the 
electron (or proton) spin to its environment. Measurement of T\ and 7^ often provides 
important information about processes in the environment of the spin being inves- 
tigated, e.g. motion in liquids and solids. We shall discuss a typical and especially 
elegant experiment in Sect. 15.4. 



14.6 The Relativistic Theory of the Electron. The Dirac Equation 



In order to correctly describe the interaction of an electron with a magnetic field, we in- 
troduced spin operators, which represent the intrinsic degree of freedom of the elec- 
tron. It was shown by Dirac that this intrinsic degree of freedom follows quite auto- 
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matically from relativistic quantum theory. We therefore wish to treat the Dirac equa- 
tion in this section. In order to arrive at a relativistic wave equation, it would seem 
appropriate to attempt to derive it in the same manner as the non-relativistic Schrodin- 
ger equation (see Sect. 9.2). 

The derivation given there can be summarised in the following “recipe” (cf. also 
Sect. 9.3): one starts with the classical relation between energy and momentum for a 
force-free particle 

E=p^/2mo (14.127) 



and replaces the energy E and the components of the momentum p by operators 
according to 






and 



h 6 

Px~*— — — 
1 0X 



h 0 

Py-^— — ’ 
1 0y 



Pz- 



A A. 

i 0z 



The last equivalence can be abbreviated as 



(14.128) 

(14.129) 



P 




(14.130) 



Following the computational rules of quantum mechanics (cf. Sect. 9.2, 3), these 
operators act on wavefunctions whereby (14.127) thus becomes the well-known 
Schrbdinger equation 

fc2 

\h-—'P=- (14.131) 

0 ^ Irrio 

We shall now attempt to apply this recipe to the relativistic relation between energy and 
momentum. The latter is 

E = ]/p^c^+mlc* . (14.132) 

If we replace E and p by operators according to (14.128) and (14.130) and allow the 
resulting expressions on both sides of (14.132) to act upon a wavefunction % we obtain 
the equation 

o 

ih — T=y *P. (14.133) 

dt 

This equation contains the Laplace operator under a square-root sign, which may 
at first appear to be only a cosmetic defect. However, this approach failed utterly when 
the attempt was made to include the effects of electric and magnetic fields on the elec- 
tron in such a wave equation. The theory had entered a cul-de-sac. Physicists chose two 
routes to lead it out again: 
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Route 1: The Ktein-Gordon Equation 

Since all of the difficulties stem from the square root in (14.133), one has to consider 
ways to avoid it. To this end, we square both sides of (14.132) and obtain 

£’^ = pV+m^c^ (14.134) 

which, of course, may be immediately translated into the wave equation 

(14.135) 

dr 

called the Klein-Gordon equation. 

The latter may be arranged in a more elegant (and relativistically more obvious) 
form by dividing both sides by and introducing the operator 

□ ^=V'--L-^. (14.136) 

The Klein-Gordon equation is then given as 
2 2 

□ 2y= moc y (14.137) 

h^ 

Let us examine its solutions. Since, for a force-free particle, we expect the solutions to 
be de Broglie waves, we use the trial function 

1#^= exp[i(A: -r-cuO] (14.138) 

in which, as usual, 

co = — and p = hk . (14.139) 

h 

If we insert (14.138) in (14.135), we obtain (14.134) as the immediate result. 

In order to determine the energy E itself, we must naturally take the square root. 
We thus obtain not only a positive energy 

£ = + Yp^c^+mlc* (14.140) 

but also a negative energy 

£ = -l/pV+m^c". (14.141) 

Since free particles can have only positive energies, we are faced here with a difficulty! 
Furthermore, the analysis of the solutions reveals that the particle density can also 
become negative, which is also an unphysical result. The Klein-Gordon equation was 
reinterpreted by Pauli and Weisskopf, who used the charge density instead of the mass 
density, and thus found it to be applicable in quantum field theory to particles with 
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spin zero. However, further development of that topic lies outside the framework of 
this book. 



Route 2: The Dirac Equation 

Dirac considered the question as to whether the root in (14.132) could not be extracted 
in some simple manner. In the limit /? = 0, we find 

]/p^c^+mlc^ moc ^ ; 

and, for = 0, (14.142) 



-^pc . 

In order to understand Dirac’s approach, let us first consider the one dimensional case 
and generalise (14.142) to 



\/ plc^+mlc^ = acpx+ Pmoc^ • (14.143) 

This relation can clearly not be fulfilled in the general case mo^O by ordinary 
numbers a and J3; however, it can be, when a and are matrices, as we shall proceed to 
demonstrate. We square both sides of (14.143), remembering that matrices do not com- 
mute, in general, so that we must maintain the order of a and p in multiplying out the 
right-hand side of (14.143). We then obtain 

plc^-^mlc^= a^c^pl-\-{aP-\- Pa)moC^Px-\- P^mlc^ . (14.144) 



For the left and right sides of this equation to be equal, we clearly require that 

a^=l; ap+^a=0; (14.145) 



These relations are familiar from the (Pauli) spin matrices! (compare Problem 14.2). 
Unfortunately, we cannot use the latter directly, since we wish to describe a three- 
dimensional, not a one-dimensional motion. Thus, we require 

^ cc\cpx+ c(2cpy+ a3cpz~\- Pmoc^ . (14.146) 



Squaring (14.146) leads, analogously to the one-dimensional case, to 



ay = l; P^=i; ajP+Paj = 0\ and 
ajak+aicaj=0 for y= 1,2,3 and A:= 1,2,3. 



(14.147) 



In addition, as always in quantum mechanics, the operators (matrices) are Hermitian. 
These relations may be fulfilled in various (but physically equivalent) ways, for 
example 



aj 0 






1 0 
0 -1 ’ 



(14.148) 
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where the oj are the Pauli spin matrices (cf. 14.24, 33 without h). The “l”s in p repre- 
sent 2x2 identity matrices, so that ^ may be written in the conventional notation as 



)8 = 



10 0 0 

0 10 0 

0 0-10 
0 0 0 -1 



(14.149) 



After these intermediate steps, we can again attack the Dirac equation, employing the 
translation rules (14.128) and (14.130) and applying them to the equation 

E= aicpx+a2cpy+aicp^+ Prtioc^ . (14.150) 

This leads to 



6 2 
ih — !P= {aicpx+ U2cpy+ aicp^+ Pmoc ) !P, 
dt 



(14.151) 



the Dirac equation. 

Since a, and )S are 4 x 4 matrices, they must operate on vectors with four com- 
ponents, i.e. !Pmust be of the form 









9^2 



(14.152) 



In the preceding sections dealing with the electronic spin, we became acquainted with 
wavefunctions having 2 components; in the Dirac theory, they have four! This is a 
result of the fact that the Dirac equation allows both positive and negative-energy solu- 
tions for free particles. 

As the reader may verify in one of the problems to this chapter, the Dirac equation 
yields the same energy spectrum as the Klein-Gordon equation; it is given in (14.140, 
141), and in Fig. 14.3. One can easily convince oneself that the solutions of the Dirac 
equation for force-free particles are plane waves having the form 



nr,t) 



^2 



exp(ifc -r-icot ) , 



(14.153) 



where the constants *Pi, . . . IKt are computed in Problem 14.6. 

In the Dirac equation in the form (14.151), the time derivative plays a special role 
relative to the spatial-coordinate derivatives. However, in relativity theory, time and 
space coordinates have a symmetric position as components of space-time four-vectors; 
thus, in the literature, a symmetrised form of the Dirac equation is often used. It is 
obtained by multiplying (14.151) on both sides from the left by and introducing 

new matrices 



E 





i 


















— 


— 0 























Fig. 14.3. Energy spectrum of 
the Klein-Gordon equation 
and the Dirac equation 
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y^ = Paj, with y'= 1,2,3. (14.154) 

It may be shown that the resulting equation is “Lorenz covariant”: 

ih ( ¥'= moC^, (14.155) 

\ 0JCi 0X2 0x3/ 

with = ct, Xi = x, X2 = y, X3 = z. 

The explicit forms for the matrices and / are 



0 I 1 0 I 

io -li’ 



yJ = 



0 cr^ 
- 0 



(14.156) 



where the are again the Pauli matrices. 

Finally, we discuss the inclusion of the action of electric and magnetic fields on the 
electron in the Dirac equation. For this purpose, we use the procedure of the Schrodin- 
ger theory once again: 

1) The potential energy V{r)= -eV, which results from the electrostatic potential 
F, is added in analogy to (9.32). This can also be expressed by adopting the following 
expression: 

>ih — + eV. (14.157) 

dt dt 



2) The magnetic field is taken into account by replacing the momentum operator 
(as in Sect. 14.1) by 

+ (14.158) 

i i 

where A is the vector potential. 

The resulting Dirac equation has been solved for several cases, in particular for the 
hydrogen atom. The results are in very good agreement with experiment, apart from 
the corrections due to quantum electrodynamics (Lamb shift). 

In spite of the success of the Dirac theory, the question of the meaning of the 
negative energy values for free particles remained open. They would permit an electron 
with a positive energy to emit light and drop down to deeper-lying, i.e. negative energy 
levels, and thus all particles with positive energies would finally fall into this energy 
chasm. 

Dirac had the ingenious idea of assuming that all the states of negative energies were 
already occupied with electrons, following the Pauli principle, according to which each 
state can contain at most two electrons with antiparallel spins. The infinitely large 
negative charge of this so-called “Dirac sea” can be thought to be compensated by the 
positive charges of the protons, which likewise obey the Dirac equation and must fill a 
corresponding positively charged Dirac sea. The vacuum would, in this interpretation, 
consist of the two filled Dirac seas. 

If we now add sufficient energy that an electron from the Dirac sea can cross over 
the energy gap of 2 mo c^, an electron with positive energy would appear, leaving behind 
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a hole in the Dirac sea. Since this hole is a missing negative charge, but the Dirac sea(s) 
were previously electrically neutral, the hole acts like a positive charge ( + e). Further- 
more, it has the same properties as a particle, so that it appears as such. The creation of 
electron-hole pairs can indeed be observed; the positively charged particles are 
experimentally known as positrons. 

In modern quantum field theory, the creation of positrons can be described directly 
by means of a formal trick, without having to invoke the infinite filled Dirac sea. On 
the other hand, precisely this idea of a Dirac sea provides an intuitive picture for the 
appearance of positively charged electrons, i.e. the positrons. 



Problems 



14.1 The Landau levels 

If an otherwise free electron is moving in a magnetic field, it is forced into a circular 
path in the plane perpendicular to the magnetic field. It thus has a periodic motion and 
would be subject to quantisation, even in the Sommerfeld formulation. This quantisa- 
tion leads to discrete levels, the Landau levels. These also result from an exact quantum 
mechanical calculation. 

Problem: Solve the time-independent Schrodinger equation of a particle with charge 
(-e) which is moving in the x-y plane perpendicular to a constant magnetic field B. Do 
not take the electron spin into account. 

Hint: Use the vector potential^ in the form^ = (0,^;^,0) and the trial solution 
V/(x,7,z) = e'*-^0(x) . 

In addition, make use of the fact that (p{x) satisfies the Schrodinger equation for a dis- 
placed harmonic oscillator. 



14.2 Show that for the spin operators s^ and Sy (14.33 a, b), the following relations 
hold: 



SxSy-\- SySx — 0 , 





2 



= 1 . 



Hint: Use the explicit matrix form. 



14.3 Demonstrate that the relativistic expression for the energy, (14.132), may be 
written in the form 

2 mo 

provided that 



2 mo 2 



Hint: Expand the square root in a series. 
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14.4 Show that the (charge) conservation law in the form 

+ div y = 0 
dt 

may be derived from the Klein-Gordon equation. 

Hint: Multiply the Klein-Gordon equation (written with by T* and subtract from 
the result its complex conjugate. Use: 



Q = Y T , 

2woC^ y / 

j = — ^ (!P* grad T- IPgrad !P*) . 
lirriQ 



14.5 Show that each component of (14.153) satisfies the Klein-Gordon equation. 
Hint: Write the Dirac equation (14.151) in the form 



ih— = je^ ( 1 ) 

Qt 



then take \h{Q/Qt) on both sides; use (1) again and rearrange using the Dirac 
matrices. 



14.6 Solve the Dirac equation for a force-free particle moving in the z direction. 
Hint: Substitute the trial solution 






^0 

'Pi 

Pi 

P 4 J 



Qxp(ikz-icot) 



into (14.151) and solve the resulting algebraic equation. 
Which energies correspond to the various solutions? 




15. Atoms in an Electric Field 



15.1 Observations of the Stark Effect 

In 1913, Stark observed a splitting of the lines of the Balmer series of hydrogen (8.2) in 
an electric field. He was studying the light emission of H atoms in the field of a 
condenser (Figs. 15.1, 2). Since then, frequency shifts in optical spectra in the presence 
of eletric fields have been generally called the Stark effect. 

The effect is experimentally more difficult to observe than the Zeeman effect, 
because it is necessary to generate strong electric fields without sparking over. It has 
been far less important to experimental atomic physics than the Zeeman effect. 

One observes: 

- With hydrogen and similar atoms, such as He"^, Li'^'^, etc., a splitting of the 
terms with / + 0 and the spectral lines associated with them. The splitting is propor- 
tional to the field strength F. This so-called linear Stark effect is present when the 
/ degeneracy - the degeneracy of states with the same principal quantum number 
n and different orbital angular momentum quantum numbers / - is lifted by the 
external electric field, when it is not already lifted by internal atomic fields; 

- Displacement and splitting of terms in all atoms proportional to F^. This is the 
quadratic Stark effect. 

The quadratic Stark effect can be understood qualitatively in an intuitive model. 
The lifting of the / degeneracy by the other electrons in an atom always leads to states 
which have no electric dipole moment averaged over time. This can be demonstrated 




Fig. 15.1. Canal ray tube for investigation of the emission of atoms in an elec- 
tric field: the Stark effect. The potential between the cathode C and the elec- 
trode H can be as high as 8000 V. The resulting splitting of spectral lines is ob- 
served through the window W. In order to observe neutral H atoms, the 
positive ions of the canal rays must be neutralised. This is not shown in the 
figure 




Fig. 15.2. Splitting of the hydrogen atom lines in an 
electric field. The strength of the field varies along the 
light source, the image of which is shown after passage 
through a spectrograph slit. The field is 10^ V/cm in the 
region of smaller splitting near the bottom of the figure 
and rises to a value of 1.14 • 10^ V/cm in the region of 
the greatest splitting. From K. H. Hellwege, Einfuhrung 
in die Physik der A tome, Heidelberger Taschenbticher, 
Vol. 2, 4th ed. (Springer, Berlin, Heidelberg, New York 
1974) Fig. 45 
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exactly by quantum mechanics. The applied electric field induces an electric dipole mo- 
ment = a JF in the atom, where a is the atomic polarisability. The latter is naturally 
a function of the quantum numbers of the atomic state and is different for each elec- 
tron configuration. 

The electric field acts on this induced dipole moment. The interaction energy is 
given by 

\p ■ F = \aF^ . (15.1) 

We have thus explained qualitatively the proportionality between the term shifts 
and the square of the electric field strength. 

The linear Stark effect, which is observed in the hydrogen atom, cannot be so easily 
understood on an intuitive basis. This is an effect which can be understood only in 
terms of quantum mechanics. The degeneracy of the /-states in the absence of a field 
F is lifted by the electric field, because the Coulomb potential of the nucleus is per- 
turbed by the field F. This effect will be treated in greater detail in Sect. 15.2. 

The linear Stark effect, which is observed in the hydrogen atom, cannot be so easily 
understood on an intuitive basis. This is an effect which can be understood only in 
terms of quantum mechanics. The originally present degeneracy of the /-states is lifted 
by the electric field. This effect will be treated in greater detail in Sect. 15.2. 

The fundamental difference between the Stark effect and the splitting of spectral 
lines in a magnetic field is the fact that in an electric field, states with the same absolute 
value of the magnetic quantum number ntj, i.e. ntj and -ntj, behave in an identical 
manner. This can be easily understood: the effect of an electric field on a “clockwise” 
and on a “counterclockwise” rotating electron, when the spatial distribution of the 
electrons is otherwise the same, is, averaged over time, the same. The number of split 
components is therefore smaller in the Stark effect than in the Zeeman effect: the 
number of different terms is not 2y-l-l, but rather y +l for integral j and j+1/2 for 
half-integral j. 

An example is the Stark effect of the Na D lines, shown in Fig. 15.3. The magnitude 
of the Stark shift is about 0.05 A for the Na D lines in fields of about 10^ V/m (10^ 
V/cm). It increases with the principal quantum number n, since orbits with a larger 




mj 

±3/2 

± 1/2 

+ 1/2 



± 1/2 



Fig. 15.3. Energy level diagram showing the Stark 
effect on the sodium doublet 3^P3/2^/2- 3^5i/2 and 
the splitting pattern of the D lines 



V 
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principal quantum number also have a larger polarisability. Therefore the Stark effect 
is extremely important for the investigation of Rydberg atoms (cf. Sect. 8.12). 

The Stark effect which is caused by the strong electric fields resulting from the 
chemical bonding between atoms, is extremely important to the understanding of 
molecular spectra. It is also important for the clarification of the influence of the 
crystal electric field in solids on the atomic term diagrams of component atoms, as well 
as in gases at high densities. In the latter, the Stark effect is the most important source 
of spectral line broadening. 



15.2 Quantum Theory of the Linear and Quadratic Stark Effects 

15.2.1 The Hamiltonian 

We shall treat the quantum theoretical explanation of the Stark effect in some detail 
here, since we shall, in the process, be able to introduce the general and important 
methods of perturbation theory. 

We wish to investigate how the wavefunctions and the energy levels of an electron 
are changed, when, in addition to the attractive nuclear potential V{r), a constant elec- 
tric field is acting on it. We write the Hamiltonian of the complete problem in the form 

= (15.2) 

where 

fc2 

.^ 0 = V^+K(r) (15.3) 

Into 

is the original Hamiltonian without the applied field. In (15.2), as in the following, the 
upper index “P” indicates “perturbation”. 

If the electric field has the field strength F, the electron is acted on by the force 

-eF, (15.4) 

(In order to avoid confusion between the energy E and the electric field strength, we 
denote the latter here by F.) The corresponding potential energy, which results from 
“force times distance”, is then 



V^ = eFr, 



(15.5) 



provided F is homogeneous. 

Since the formalism which we are about to develop may be applied to perturbations 
which are more general than (15.5), we have written instead of in (15.2). It is 
found in many cases that the applied electric field produces only a small change in the 
electron wavefunctions and energies, i.e. it acts as a so-called small perturbation. In 
order to express the smallness of this perturbation explicitly, we write in the form 

= A (15.6) 



where A is a small parameter. In the following, we also assume that the time-inde- 
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pendent Schrodinger equation without the external perturbation potential has already 
been solved: 

Jfo0v = ^v0v (15.7) 

The indices 0 on jfo ^^d indicate that these quantities refer to the unperturbed 
problem. We shall at first assume that the energies are all different from one 
another. 



15.2.2 The Quadratic Stark Effect. Perturbation Theory Without Degeneracy** 

To be able to solve the Schrodinger equation which also contains the perturbation 
potential, namely 

je\i/ = Ey/, (15.8) 

we represent the solution for which we are searching, y/, as the superposition of the un- 
perturbed solutions (py,. We expect, indeed, that the electric field will shift and perhaps 
also change the shapes of the wavefunctions. These modified wavefunctions may be 
constructed from the unperturbed ones by adding them to wavefunctions belonging to 
other energy levels (Fig. 15.4). On the basis of such considerations, which also may be 
justified mathematically in a strict sense, we arrive at the following trial solution for the 
wavefunctions we are seeking: 

00 

W(r)= Z Cy,(t>y,{r) . (15.9) 

V=1 





Fig. 15.4. Example of how the super- 
position of two wavefunctions 0^ and 
02 can lead to a new wavefunc- 
tion with a shifted centre of gravity 
(centre of charge) {right) 



Here it is important to note that the wavefunctions 0^ depend upon the position 
coordinate r, but that the coefficients c„ do not. To fix the latter, we insert (15.9) into 
(15.8) and obtain immediately 

^0 I c,0,(r)-l- ** E c,0,(r) =EY. c^(t>^{r) . (15.10) 



^ In this book, we treat time independent perturbation theory. Time dependent perturbation theory, which 
is also very important, is described in detail in Vol. 2, H. Haken and H.C. Wolf: Molecular Physics and 
Elements of Quantum Chemistry. 




15.2 Quantum Theory of the Linear and Quadratic Stark Effects 



255 



In the first term on the left side, we shall use the fact that the wavefunctions (py obey 
(15.7) and we may thus replace Jto0v by Ey<py. In order to free ourselves from the r 
dependence in (15.10), we multiply from the left by 0* and integrate over all space. As 
we shall show in the appendbc, the wavefunctions are orthonormal, i.e. the following 
relations hold; 

\(l>*</>ydV=d^y. (15.11) 

Furthermore, we introduce the abbreviations 



(15.12) 

Since the parameters H^y carry the two subscripts fi and v, one often thinks of the H^y 
arranged in a square array. Such an array is called a “matrix” in mathematics, and the 
Hj^y are thus also called “matrix elements”, or more exactly, “matrix elements of the 
perturbation operator With the help of (15.11) and (15.12), we obtain the follow- 
ing equation from (15.10) 



(E^,-E)c^+lH^yCy = 0, (15.13) 

V 

which one must imagine as written out for all indices ju. Thus far, our procedure is com- 
pletely generally valid and makes no use of the small magnitude of the perturbation. 
We now assume, however, that the perturbation is small, in that we imagine, according 
to (15.6), that the parameter A increases from zero. If the perturbation is exactly equal 
to zero, the solution sought, (15.9), must naturally be the same as one of the starting 
solutions <j)y. The coefficients which result for A = 0 are indicated by the superscript 0. 
The initial state is denoted by a subscript k. We thus obtain the relation 



- 1 for V = K 
'0 for V + 
or in shorter form, 

c®= ^ 

^ V ^VK • 



(15.14) 

(15.15) 



If we now let A increase, the coefficients Cy will also change, of course. We shall 
expect that as a first approximation, the coefficients increase proportionally to A. As 
the next approximation, we must then take into account changes proportional to A^, 
and so on. The same will naturally also hold for the new energy values E. We thus 
arrive at 

Cy=dy,+ Xci^^ + X^cfH... (15.16) 

and 

£' = £’®-^A£<‘>-t-A^e^^>-t-... . (15.17) 



We subsitute these expressions in (15.13) and thereby obtain 

(El-El-Xe^^^-Xh ^^'^- . . .)(«5^,,+ Ac<‘4 . . .)+ I XHly{dy,+ Ac<‘4 . . .) = 0 . 

^ (15.18) 

Expressions (15.16) and (15.17) define orders of magnitude, as one can easily 
convince oneself by setting, e.g. A = 0.1. In this case, A^ = 0.01, which is only 10% of 
A. Speaking a bit loosely, what we are doing is to solve (15.18) for the different decimal 
places. In strict mathematical terms, this means that we must multiply out all the terms 
of (15.18) and arrange them according to powers of A. We then have to require that the 
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coefficients of the individual powers of A cancel out independently. For the zeroth 
power, we obtain 

= (15.19) 

which is identically fulfilled. For the 1st power of A, 

+ 0 • (15.20) 

For further discussion of this equation, we differentiate between the cases where 
yU = AT and n^K. For fi = k, (15.20) reduces to 

= = (15.21) 

For the perturbed energy, this means according to (15.17) that the 1st order perturba- 
tion approximation, 

E = eI+hI^^, (15.22) 



applies. If we choose yU + /c, the coefficients of the 1st order perturbation approxima- 
tion can be calculated from (15.20): 



M)- 
^ 1 1 



H\ 






^ K -^11 



fi + K 



(15.23) 



So far, the coefficient has not been determined. As can be shown from the normal- 
isation coefficient, it must be set equal to zero: 

4^’ = 0. (15.24) 

If we substitute the coefficients we have calculated into (15.9), the perturbed wavefunc- 
tion is, in the 1st order perturbation approximation. 



¥(r) = + I 



H 






E^-Ei 






(15.25) 



Now we can take into account the terms in second order, i.e. with A^. A short cal- 
culation yields 



.(2)^ \HU^ 



V*K ■ 



(15.26) 



With this, the energy in the second order perturbation approximation can be expressed 
as 



E = E'^+H^,+ 



v+« eI-e^ 



(15.27) 



We shall now examine the meaning of the formulae (15.25) and (15.27) in the case 
when an external electric field F is applied. It can be shown (see Sect. 16.1.3 on selec- 
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tion rules), for example that for the hydrogen atom = The matrix elements 
which are not equal to zero are, according to (15.5) and (15.12), proportional to the 
field strength F. Thus the energy E is shifted from the unperturbed energy by an 
amount, according to (15.27), which is proportional to F^. One therefore speaks of the 
quadratic Stark effect. 



15.2.3 The Linear Stark Effect. Perturbation Theory in the Presence of Degeneracy* 

In addition to this quadratic Stark effect, observations have shown a linear Stark 
effect. We approach it as follows. In a purely formal way, we can see from (15.23 - 27) 
that the method we used above will not work if the denominator, i.e. cancels 

and at same time the matrix element in the numerator is not equal to zero. This can 
actually happen, however, if we are considering degenerate states, such as we have seen 
in the hydrogen atom. There we have an entire set of different wavefunctions with the 
same principal quantum number n, but different / and m, which belong to a given 
energy. To treat this case in the presence of a perturbation, one must fall back on the 
so-called perturbation theory in the presence of degeneracy. 

Let us briefly review what was done in the first step of perturbation theory in the 
absence of degeneracy. We set up the requirement (15.14). In the case of degeneracy, as 
we know, it is not only the mutually degenerate wavefunctions which can be solutions 
to the Schrodinger equation for the energy E®, but also any linear combination of these 
wavefunctions. Thus if we think of the perturbation being turned off, the perturbed 
solution can be transformed into a linear combination of unperturbed solutions whose 
coefficients are not known. The basic idea of perturbation theory in the presence of 
degeneracy is to find these coefficients, in the zero order approximation, by a 
systematic procedure. To this end we write 

!//(/•)= I c®0v(^) + corrections , (15.28) 

V only over 
degenerate 
states 

where the summation includes only the mutually degenerate states. We ignore the cor- 
rection terms. Equation (15.28) is formally the same as our earlier (15.9), but now we 
are not summing over all states. However, the coefficients can be formally deter- 
mined if we return to the equation system of the form of (15.13), and allow the set of 
coefficients to replace the coefficients there. If we have N mutually degenerate 
states, we now have N equations with unknown coefficients before us. In order that 
the homogeneous system of equations be soluble, the determinant of the coefficients 
must vanish. This provides the condition 



{El-E+Hl,) HI2 ... 

{El-EyHlf) ... H% 

Hi,, ... {El-E+Hlff) 



(15.29) 



The determinant here is also called a secular determinant. If calculated, it yields an A^th 
degree polynomial in the energy E. When this is set equal to zero, it becomes an 
algebraic equation for E which has A^ roots, some of which may be equal to each other. 
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As a concrete example, let us treat the first excited state of the hydrogen atom with 
the principal quantum number n = 2. The wavefunctions of hydrogen are, as we know, 
indicated by the quantum numbers n, I and m. To relate these to the present system of 
indices, we write 



n - 2 , 

V 

where we use the table 



1 


for 


/ = 0 , 


m = 0 


2 


for 


/=!, 


m = 0 


3 


for 


/=!, 


m = \ 


4 


for 


/=!, 


m = 



The trial solution (15.28) thus becomes 



(15.30) 



(15.31) 



y/(r) = + cf^(t)2ir) + cf^Mr) + cf^<l)^{r) , 



(15.32) 



where, to repeat once more, the 0’s are wavefunctions of the hydrogen atom in the 
n = 2 state, which are all degenerate. The matrix elements (15.12) are then, in concrete 
terms, 

(15.33) 

^ V 

where it is assumed that the field is applied in the z direction. Using selection rules, it 
can be shown, as in Sect. 16.1, that all the matrix elements disappear except for 

Hl2 = Hl,. (15.34) 

This can be written in the form 

Hl2 = Hli = eFd (15.35) 

because the wavefunctions referred to are real. In the present case, where TV = 4 and all 
matrix elements except for (15.35) disappear, (15.13) reduces to 



(E2~E)cx + eFdc2 = 0 , 


(15.36) 


eFdci + {El-E)c2 = 0, 


(15.37) 


{E\-E)c2 = 0, 


(15.38) 


(EI-E)c^ = 0. 


(15.39) 



It is obvious that this system of equations breaks down into two groups of two: 
(15.36) and (15.37) in one group, and (15.38) and (15.39) in the other. The determinant 
for (15.36, 37) is 




15.2 Quantum Theory of the Linear and Quadratic Stark Effects 



259 



E\-E eFd ^ Q 
eFd E\-E ~ 

This goes to zero when E assumes the values 
£■+ =E\±eFd. 



(15.40) 



(15.41) 



It can be shown that the positive sign is associated with Cj = Ci, and the negative sign 
with Cl = -Cl- The energy E is increased or decreased with respect to the unperturbed 
energy, by an amount proportional to the field strength F. Equation (15.38) or (15.39) 
requires that the perturbed energy be the same as the unperturbed energy. In particular, 
it turns out that the wavef unctions 03 (r) and 04 (r) are in each case the “right linear 
combination”. This can also be seen from the fact that for 03 (r) and the per- 

turbation theory without degeneracy actually does not fail, because for the critical 
terms (zero energy difference in the denominator), the matrix elements of the 
numerator also go to zero. 

We thus obtain overall the scheme shown in Fig. 15.5 for the energy splitting. The 
same figure shows the wavefunctions in the presence of the field. 

The linear Stark effect discussed here is a special case, in that it is only observed in 
the hydrogen atom. The reason for this is easy to understand: the matrix element 
(15.33) differs from zero only when / /'. In contrast to hydrogen, the / degeneracy is 
lifted in other atoms, i.e. 



as was shown in Chap. 11. 








Fig. 15.5. The linear Stark effect. Lower left: As 
the field strength F increases, the energy level Ei is 
split into three levels. Upper left: Representation 
of four mutually degenerate wavefunctions in a 
single figure. Dotted line, 5 functions; solid lines, 
the p function “dumbbells” in the a:, y and z direc- 
tions. Upper right: Superposition of 5 function 
and p dumbbell in the z direction causes a shift in 
the centre of charge of the electron. Middle right: 
The dumbbells in the x and y directions are not af- 
fected. Lower right: Superposition of s function 
and p dumbbell (with amplitude in the opposite 
direction compared to the previous case) leads to a 
shift of the centre of charge in the negative z direc- 
tion 



Field strength 
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15.3 The Interaction of a Two-Level Atom 
with a Coherent Radiation Field 

In the previous chapters on the interaction of a spin with a changing magnetic field, we 
came upon the interesting phenomenon of spin flipping, which has found numerous 
applications in physics and chemistry. In this section, we shall show that a two-level 
atom interacts with a coherent radiation field in a manner which is exactly analogous to 
spin flipping. Although the spin is a system with exactly two levels, this assumption is 
only an approximation for an atom. We simply assume that a radiation field induces 
transitions between two neighbouring levels, and that all the other levels of the atom 
are energetically so far from the two under consideration that we can neglect the effects 
of the other states on the two at hand. This is understandable in light of the perturba- 
tion theory in the absence of degeneracy which was treated above, because combina- 
tions of the wavefunctions associated with distant levels with the wavefunctions 
belonging to the two close levels will have large energy differences in the denominator, 
and will thus make only small contributions (15.25). 

In the quantum mechanical treatment, we begin with a Schrddinger equation for an 
electron which is moving in the potential field V of the nucleus and in the additional 
potential of the radiation field Ka. This equation has the form 

f--^V^+F+Fa) V/(/-,0 = i^-^^^^^. (15.42) 

V 2mo j dt 

To find the explicit form of V^, let us think of the radiation field in the form of a plane 
wave: 

F = Focos(kx- cot) . (15.43) 

In order to avoid confusion between the energy and the electric field strength, we again 
indicate the latter with F. We assume, furthermore, that the atom is localised at /* = 0. 
Since the wavelength of the light A = 2 tt/A: is in general much larger than the extent of 
the electron wavefunctions of an atom, we can, to a very good approximation, set x = 0 
in (15.43). We then have a practically homogeneous radiation field over the atom, 

F = FoCOs(cot) . (15.44) 

We also assume that the radiation field is polarised in the z direction: 



Fo = (0,0,Fo). (15.45) 

The force exerted on an electron by an electric field F is given hy -e - F. The result- 
ing potential energy (negative of the force times the z vector) is 

V^=eFoZCOScot . (15.46) 

In the following, we shall use this in (15.42). We assume that we have already solved 
the Schrddinger equation in the absence of an external field: 
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= y = l,2, (15.47) 

i.e. we assume that the wavefunctions and energies are known, at least for the indices 

y = l,2. 

Since we expect transitions only between the two levels 1 and 2, we cast the wave- 
function of (15.42) in the form of a superposition of the unperturbed wavefunctions of 
(15.47), 

(//(r,0 = Ci(O0i(/') + C2(O02(O • (15.48) 

In order to determine the still unknown coefficients and C 2 , we substitute (15.48) into 
(15.42), multiply from the left by (pf or exactly as in Sect. 15.2, and integrate over 
the total space. By introducing the abbreviation 

Hfj = l0f(r)eFoZ0j{r)dV coscot , (15.49) 

we obtain the equations 

Cy = ^[{Ei + Hfi)C, + Hf2C2] (15.50) 

ih 

and 

C2 = -L [if p Cl + {E2 + h!2) C2], (15.51) 

ih 

in analogy with Sect. 15.2. 

In many cases, we can assume that //fi and H 22 vanish (compare Chap. 16 on sym- 
metries and selection rules). For the solution of (15.50, 51), we attempt the trial 
function 

= . (15.52) 

With this equation, (15.50) and (15.51) reduce to 

rfj = (15.53) 

ih 

and 

^2 = — . (15.54) 

ih 

Meanwhile we have assumed, according to (15.43), that the radiation field is mono- 
chromatic. We now introduce the further assumption that the field is in resonance with 
the electronic transition. This means that the following relation is valid: 



E2-E1 = hco . 



(15.55) 
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If we now extract the factor 

coscyt=|(e‘"'+e-'"') (15.56) 

from (15.49), and multiply by the exponential function in (15.53), we obtain the overall 
factor 

|(l + e"^'“")- (15.57) 

As we shall see in the following, rfj and vary quite slowly in time compared to the 
frequency cu, so long as the field strength is not too great. This makes it possible for us 
to average (15.53) and (15.54) over a time which is long compared to l/cu, but still 
short compared to the time constant which determines the change in dj, see (15.61). 

The result of this averaging is that the rapidly changing term exp(-2icu0 makes a 
contribution which is much less than 1 and can therefore be neglected compared to 1 . 
This is called the “rotating wave approximation” in the literature. The expression 
comes from spin resonance. (In Sect. 14.4 the term exp( - xcot) did not appear, because 
we had applied a rotating magnetic field in the first place.) In (15.54) there is a term 
with exp( + 2icu/) which corresponds to (15.57), and in this case too, it is negligibly 
small. If we abbreviate the integral in (15.49) by substituting the dipole moment matrix 
element (0J^ = |0f(r)ez0y(r)rfF, (15.53) and (15.54) reduce to 



^i = tVtW.)i2C?2 (15.58) 

ih 

and 

<^2 = T-|Fo(0,)2irfi. (15.59) 

ih 

These equations are strikingly similar to the spin equations (14.94) and (14.95) 
which we met in Sect. 14.4. It can be shown that (6z)n = (^^)?i can be chosen to be real. 
By introducing another abbreviation, 

Q = ^Fo(d,)n, (15.60) 

2h 

where Q is to be understood as a frequency, we obtain as a solution to (15.58, 59) 

rfi = cosQ/, (15.61) 

d2 = -isin^2^. (15.62) 

We have based this on the assumption that at time ^ = 0, the electron is known for 
certain to be in the lower level. Thus the Schrddinger equation (15.42) for a two-level 
system interacting with an external monochromatic radiation field is solved. The coef- 
ficients Cl and C 2 in (15.48) now obviously have the form 



Cj = e cos Qt , 


(15.63) 


C2 = 


(15.64) 
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As we know, the square of the absolute value of Cj gives the probability of finding 
the system in state j. |cyp can thus be unterstood as the occupation number Nj of the 
state j. As indicated by the corresponding formulae 



= |ci 1^ = cos^ Qt , (15.65) 

N2 = |c2p = sin^f3/, (15.66) 

the electron oscillates with the frequency Q between states 1 and 2. It is instructive to 
calculate the dipole matrix element according to 



y/dV = (e^)nC* C2+(6»^)2ic| q . 



(15.67) 



The final result of the whole process is 

dz = -(0Jl2sin(2^2O sincu^ . (15.68) 

This says that the dipole moment swings back and forth with the rapidly oscillating 
component sincu^, and that its magnitude is also modulated by sin2 Qt. The dipole 
moment is thus largest when the electron has exactly the occupation number 
N^ = N 2 = y, that is, its probability of occupying either level is the same. The result 
(15.68) and the formulae (15.65) and (15.66) are very closely analogous to the results 
obtained for spin resonance in Sect. 14.4. We shall examine this analogy more closely 
in the next chapter. It makes it possible to extend a series of spin experiments to optical 
transitions between electronic states in atoms. Such experiments require coherent light 
with a high field strength. The latter, Fq, is necessary so that the transitions can occur in 
a time to-l/Q-l/Fo - [compare (15.60)!] - which is so short that the electron 
motion is not appreciably perturbed by other effects, e.g. collisions between atoms in 
gases or the spontaneous emission of light from excited states. Typical values for lie 
between 10"^ and 10"^^ s. 



15.4 Spin and Photon Echoes 

In this section we shall treat two especially interesting phenomena, spin echo and pho- 
ton echo. If we compare the results of Sect. 14.4 with those of the preceding section, we 
see a very close analogy between the behaviour of a spin subjected to both a constant 
magnetic field and a transverse oscillating magnetic field, and an electron in a two-level 
atom which is subjected to an oscillating electric field. In both cases we assume that the 
frequency of the applied field is in resonance with the transition frequency of the spin 
or of the electron transition from the lower to the upper state. With reference to the 
analogy which we shall wish to use, we shall first discuss the behaviour of a spin. 

As we saw in Sect. 14.4, the application of a coherent, resonant field causes the spin 
eventually to flip. How far it flips depends on the length of time the external field is 
applied. If one lets the field work just long enough to flip the spin by n/2, one speaks 
of a n/2 or a 90° pulse. If the field is left on twice as long, the spin will flip completely 
over. In this case one speaks of a tt or 180° pulse. The so-called spin echo is an impor- 
tant application of these ideas. Here one first applies a n/2 pulse (Fig. 15.6). In a 
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<g(t) 




^ \ pulse 




n pulse 




~1 






^ ^ 



Fig. 15.6. The spin echo (photon echo) experiment. 
Above: the applied pulses of electric field strength 
(photon echo) or magnetic field (spin echo) as a 
function of time. Below: magnitude of the dipole 
moment (spin) of the atom as a function of time 
(schematically) 




Fig. 15.7. Spreading out and moving together of 
spins. Left: Starting condition, all spins parallel. 
Middle: Spreading out of spins. Right: Moving back 
together after 180° pulse (schematically) 



number of practical cases, the spins do not precess with the same velocity, due to 
spatially varying static magnetic fields. They therefore spread out in the course of time 
(see Fig. 15.7). Denoting the frequency width of the precession by zl cu*, we may define 
a mean time, within which the spins spread, by A of = 2n/T^, A of is called 
inhomogeneous width”. A single precessing spin can emit electromagnetic radiation. 
Because the spins get out of phase with each other, so do their electromagnetic emis- 
sions, which leads to a reduction in the total intensity. If one now applies another 
pulse, this time a 180° pulse (Fig. 15.6), the spins are flipped. What happens here can 
best be compared with runners on a track. At the beginning of the race, all the runners 
are at the same place, the starting line. After the starting gun (the 90° pulse), they have, 
however, moved different distances away from the starting line because of their differ- 
ent velocities. The effect of the 180° pulse is the same as that of a second gun, which 
signals the runners to turn around and return to the start at the same speed as before. 
Obviously, they all reach the start at the same time. For the spins, this means that at a 
certain time after the 180° pulse, they will again all be in phase, and thus their radiation 
will be in phase. As a result, the original radiation intensity is reached again. This 
picture must be somewhat modified, because there are also irreversible phase changes 
of the spins, which are characterised by the so-called homogeneous linewidth, which we 
shall discuss in Sect. 16.2. Due to this homogeneous linewidth, the original starting 
intensity can no longer quite be attained (see Fig. 15.8). If one repeats the 180° pulse, 
the result is that shown in Fig. 15.9. The decay time 72, which we met earlier in 
Sect. 14.5 in the Bloch equations, can be calculated from the decay of the peaks. The 
homogeneous linewidth can then be determined from T 2 . A(D = 2n/T2. The analogy 
between the spin and the two-level atom has now made it possible to apply the entire 
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Fig. 15.8. Decay of the spin emission and echo emission of protons in water. From A. Abragam: The 
Principles of Nuclear Magnetism (Oxford 1962) 




Fig. 15.9. A series of spin echos obtained from protons in ordinary water. In this method, a 90° pulse is 
applied at time / = 0, and is followed at times t, 3t, 5t,..., (2«-1)t by a 180° pulse. The echoes are 
observed at the times 2t, 4t, . . . , 2«t (« an integer). It can be shown that the height of the peaks decays 
according to the formula /(«) = exp(-2«r/r2). From A. Abragam: The Principles of Nuclear Magnetism 
(Oxford, 1962) 



process of spin echo to the radiation field in the case of the photon echo. We leave it to 
the reader to develop this analogy on the basis of the above. 

To this end, let us consider a system of two-level atoms. An example is the ruby, in 
which chromium ions are included as impurities in a basic lattice of aluminium oxide. 
To a certain approximation, these ions can be treated as two-level atoms. As we saw in 
Sect. 15.4, the occupation number of the upper state increases due to the applied 
resonant oscillating electric field (15.66). The electron of the impurity atom is thus to 
be found more and more frequently in the upper state, while the occupation of the 
lower state decreases correspondingly. Eventually a state is reached in which the 
occupation numbers are equal. Because this is an exact analogy to the state in which the 
spin has flipped through 90°, or in other words, the occupation numbers of the “spin 
up” and “spin down” states are equal, one speaks in this case, too, of a n/2 or 90° 
pulse. If one applies the external electric field to the electron for twice as long, it goes 
completely into the upper state, again in analogy to the spin, which is at this point 
rotated through 180° . Therefore one speaks again, in the two-level atom case, of a 180° 
or n pulse. As (15.66) and (15.60) show, the “flipping” occurs more rapidly when the 
applied field strength Fq is larger. 

We now imagine that the electron has been excited by a n/2 pulse. It can be shown 
that the dipole moment of the electron then oscillates freely with the frequency of the 
optical transition, o». This can be shown by a calculation of the expectation value of the 
dipole which is analogous to (15.67) and (15.68), although there is no external applied 
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field. According to Maxwell’s Theory, however, an oscillating dipole can emit electro- 
magnetic waves, in this case, light waves. This means that the ensemble of impurity 
atoms emits light after the n/2 pulse. However, since the atoms are subject to different 
fields within the crystal, their transition frequencies are not all the same, and some of 
the oscillating dipoles emit light which is out of phase. The emitted intensity is there- 
fore reduced. 

Now, just as in the spin case, we can apply a n pulse which brings the diverging 
phases of the oscillating dipoles back together. This prediction is justified because of 
the complete mathematical correspondence between the behaviour of spin and a two- 
level atom. As the oscillating dipole moments come back into phase, they emit a light 
pulse which can be seen as the “echo” of the previously applied n pulse. In the optical 
range, the phases of the dipoles diverge very rapidly, so it is necessary in these experi- 
ments to use short pulses of about 10 “^^s and less. 



15.5 A Glance at Quantum Electrodynamics* 

15.5.1 Field Quantization 

In this section we want to sketch the nonrelativistic theory of the Lamb shift. To this 
end we first show by means of an example how the light field can be quantized. We 
start with Maxwell’s equations in vacuum which reads: 



curl£:= - — , (15.69) 

dt 

cSF 

cur\B = eojUo—, (15.70) 

dt 

diwE = 0, (15.71) 

dWB = 0 (15.72) 

where 

Moeo = \ (15.73) 



and where c is the light velocity in vacuum. Let us consider a standing electric wave 
with wave vector k and with its electric vector in the z-direction 



E = (0, O^*^) , Ez = p{t)N sin {kx) 



(15.74) 



and where /?(0 is a still unknown function of time. To derive the corresponding mag- 
netic induction we insert (15.74) into (15.69). One can convince oneself readily that 
only the y-component of this equation is non- vanishing. 



dEz _ _ dBy 
dx dt 



(15.75) 



Since the left-hand side of this equation is proportional to cos {kx) it suggests that we 
put By proportional to cos(A:jc). This leads us to the ansatz 
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By = q{t) — cos{kx) (15.76) 

c 

where we have included the factor 1/c for later convenience. This factor gives p and q 
the same physical dimension. Inserting (15.76) into (15.75) yields 

^ = cop (15.77) 

dt 

where we have used the abbreviation 



o)^ck, (15.78) 

Since A: is a wave number and c a velocity, co in (15.78) is a circular frequency. Inserting 
E (15.74) and B (15.76) into (15.70) yields 



-^=-coq. (15.79) 

dt 



When we differentiate (15.77) with respect to time and eliminate p from it by means of 
(15.79), we obtain 




+ (o^q = 0 . 



(15.80) 



This equation is the well-known equation of a harmonic oscillator with circular fre- 
quency cu. Equations (15.77) and (15.79) can be written in a very elegant form by intro- 
ducing the Hamiltonian 

je=\(o{p^+q'^). (15.81) 



With its aid we can write (15.77) and (15.79) in the form 



dq _ OJf 
dt dp 

dp _ _ 8^ 
dt dq 



(15.82) 

(15.83) 



Quite evidently we are dealing here with the Hamiltonian equations of a harmonic 
oscillator. This then allows us to identify p with the momentum and q with the 
coordinate of an harmonic oscillator. With this identification we have the key in our 
hands to quantize the electromagnetic field. This is done by a purely formal analogy. In 
Sect. 9.4 we saw how to quantize the motion of the harmonic oscillator. Here we want 
to do exactly the same. To put this analogy between the harmonic oscillator and the 
electromagnetic field on firm ground we show that (15.81) is identical with the 
energy of the electromagnetic field mode. According to electrodynamics, the field 
energy in the volume given by 
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[/= I— — dh . 

2 V Ho / 



(15.84) 



By inserting (15.74) and (15.76) into the energy expression (15.84) we obtain 



U = —N^eo 
2 






[p^ sin^(Arx) + cos^(Arx)] dx 



(15.85) 



Under the assumption of periodic boundary conditions, the integration over x can easi- 
ly be performed so that we are left with 

U=irNho{p^ + q^). (15.86) 

We find exactly the same function of p and q as occurring in (15.81). However, this 
identification now allows us to determine the still unknown normalization factor N. 
Comparing (15.86) with (15.81) we obtain 



N = 



CO 



^0 



2 

r 



(15.87) 



Now let us return to the quantization problem. We wish to utilize the analogy 
between the Hamiltonian (15.81) and that of the harmonic oscillator. It is convenient to 
use its Hamiltonian in the form 



ifi(o(n^+^^). (15.88) 

The equivalence of (15.81) with (15.88) is achieved by putting 

p = ]/nn, q = \/h^ (15.89) 

SO that the Hamiltonian (15.81) acquires exactly the same form (15.88). Here, however, 
we known what the quantum version looks like. We have to replace 77 by the operator 
0/i6(^ exactly in analogy to Sect. 9.4. By exploiting that analogy further we introduce 
creation and annihilation operators by 



1 

W 



9.^ 



+ 0 = 6^ 



-^(—+^ = b 

1 ^ \ 9 ^ / 

or, solving for p and q 



(15.90) 

(15.91) 



p = \ 



2 



(15.92) 
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The creation and annihilation operators b ^ and b obey the commutation relation 

bb^-b^b=\. (15.94) 

By using (15.92) and (15.93), we can express the free fields E and B by means of these 
operators in the form 



E, = \{b^-b) 



= (b^+b) 



-Nsin{kx) 



COS (kx ) . 

2 c 



(15.95) 



(15.96) 



The normalization factor is given by 
N = 



CO 

£o 



2 

~r 



1 



£Oi«o = — r- 



(15.97) 



With the transformations (15.90) and (15.91), the Hamiltonian (15.88) can be ex- 
pressed by the creation and annihilation operators exactly as in Sect. 9.4. 



je=h(o{b^b + \). (15.98) 

We leave it as an exercise to the reader to convince himself that this Hamiltonian could 
be also derived by inserting (15.95) and (15.96) into (15.84). For a number of problems 
dealing with the interaction between electrons and the electromagnetic field we need the 
vector potential A. A is connected with the magnetic induction by 

B = cutIA. (15.99) 



In our book we choose the “Coulomb gauge” 



divA = 0 . 



(15.100) 



For B in the form (15.96) the relations (15.99) and (15.100) are fulfilled by 



A^=0, Ay=0, A^=-(b^ + b) 



hcono 

2 



— — sin(A:x). (15.101) 

'V k 



Let us summarize the above results. When we quantize the electromagnetic field, the 
electric field strength, the magnetic induction, and the vector potential become 
operators that can be expressed by the familiar creation and annihilation operators b ^ , 
b of a harmonic oscillator. The total energy of the field also becomes a Hamiltonian 
operator of the form (15.98). 

Since the normalization of waves in infinite space provides some formal difficulties 
(which one may overcome, however), we shall use a well-known trick. We subject the 
wavefunctions 
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exp(ift^-r) (15.102) 

to periodic boundary conditions. In order to apply the above formalism in particular to 
the Lamb shift, two changes must be made: 

1) Instead of using a single (standing) wave, the electromagnetic field must be written 
as a superposition of all possible waves. 

2) Instead of standing waves, the use of running waves has some advantages. 

Since the derivation of the corresponding relations does not give us any physical 
insight beyond what we gained above, we immediately write down these relations. 



E(r) = le, 

k 



[i*A exp(i*A • r) - ibt exp ( - i*v /•)] 

2eo^ 



B{r)=Y,kxex 

A 



— [i*A exp(iftA • r) - \bt exp ( - ift;i • /•)] 

I'V 



The individual expressions have the following meaning: 
A index labeling the individual waves 
ex vector of polarization of wave 
a>x circular frequency 
kx wave vector of wave 

k = k/\k\. 

The vector potential A reads 



(15.103) 

(15.104) 



(15.105) 



A = Zex 

A 



-[bxtx^(\kx-r) + bx exp(-i*A-r)] 

2cua£o ^ 



The operators bx, bx again obey the commutation relations 



(15.106) 



II 

1 


(15.107) 


bxbx’-bx’bx=0 


(15.108) 


btbt-btbt = 0. 


(15.109) 



The Hamiltonian operator reads 



Afield = E h(Ox{bt bx+ i) . (15.110) 

As usual, the SchrOdinger equation is obtained by applying the Hamiltonian operator 
to a wavefunction, which we call : 



Afield ^ = E0 , 



(15.111) 
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We shall determine the wavefunction and energies in the exercises. For what follows, 
we need only the ground state defined by = 0 for all A, and the states which are 
occupied by a single light quantum (photon). 

<Px=bt0o (15.112) 

with quantum energy 



£;,= hoJx( + Eo) . 



(15.113) 



It has the zero point energy 



Eo=l\hco^, (15.114) 

A 

This expression is infinite, but it is also unobservable and therefore dropped. 

15.5.2 Mass Renormalization and Lamb Shift 

We shall now treat the interaction of a hydrogen atom with the quantized light-field. 
The interaction is brought about by the vector potential^ occurring in (14.12). Since ^ 
is small in the present case, we shall retain only terms linear in A, and assume 
div^ = 0. But in contrast to that former case, the fields are not externally given, but 
become now by themselves variables of the system. Therefore we must not only writer 
in the form (15.106), but we must also add to the Hamiltonian ^ei+ that of the 
quantized field, i.e. Jffieid- Thus the Schrodinger equation to be solved reads: 



(^el+^field+^int)!P=£^ 


(15.115) 


where 

2 

jfei J+F(r) 

2 mo 


(15.116) 


'Afield = D bOJxbx bx 
A 


(15.117) 


Am =—A -p = — 

mo mo A 


/ — - — - [bx exp(i*A • r) + exp (i*A • r)]p 


where 




p = ^ grad . 


(15.118) 



1 



In order to solve the Schrodinger equation (15.115), we shall apply perturbation 
theory, where 



+ ‘Afield (15.119) 

serves as unperturbed Hamiltonian and ^int as perturbation. The eigenfunctions of 
^0 are products of an eigenfunction of ^eb i*e. W and of an eigenfunction of Jffieid) 
i.e. 0. We shall abbreviate the set of quantum numbers n, l,m of y/ by n. In the fol- 
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lowing we shall be concerned with the vacuum state and one-photon states = 
bx <?o- Identifying the index v of with («,0), or {n, A) we may write the unperturbed 
wavefunction as 



<Pv = ¥n (f) where 


(15.120) 


1-H 

O 

o 

II 


(15.121) 


The corresponding energy levels are 




^v,tot~^n ~ 0) and = E^+ hco;^ (/c = A) . 


(15.122) 



For the perturbation theory we need the matrix elements of which read in bra and 
ket notation: 



H 



1 _ 
Itv- 



Wn^K 



— lex 
rriQ X 



— [bx exp (i*A • r) + bx exp ( - i*^ • r)]p 

ioix%y 




which may be rearranged as 



(15.123) 



ti\. = —l 



mo X 



h 

loixe^'f' 



[ <n I exp (i • P I « ' X I I > 



+ <n\QX^{-ikx-r)ex-p\n''){<P^\bx\^K')] . (15.124) 

As we know from the quantized harmonic oscillator, 

<^«l*A^I^,c> = <^k|^aI^/c> = 0 • (15.125) 

Therefore the perturbation energy in first order vanishes. Starting from the vacuum as 
unperturbed state, i.e. k' = 0, we further have 



<^,cI*aI^o> = 0 , <^K\bx\^o) = SxK (15.126) 

In other words, a single photon X may be generated. We assume that the atom sits at the 
origin and that the extension of the electronic wave function is small compared to the 
wavelength of the light waves exp[i*;i • r]. This allows us to ignore this factor in the ma- 
trix elements. In this way is reduced to (with k = X\) 



{n\ex-p\n') . 



Zj\ — tt\ _ ^ 

niQ 

The expression for the perturbed energy in second order therefore reads 

g(2) ^ y 

n’.xEl-El-hoxx ■ 



(15.127) 



(15.128) 
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In quantum electrodynamics, the processes which lead to are usually visualized as 
follows: 

An electron is in its initial state n, and there is no photon. Then a photon of kind A 
is emitted, [creation operation bt in (15.124) and (15.128)!] and the electron goes into 
the state n' . Finally the photon is reabsorbed [annihilation operator bi in (15.124) and 
(15.128)!] and the electron returns to its state n. This process is described by the follow- 
ing “Feynman diagram” (Fig. 15.10). 

X 



n n' n 

Fig. 15.10. Example of a Feynman diagram: virtual emission and reabsorption of a photon 



In the following we have to distinguish the different field modes more carefully. To 
this end we replace the general mode index A by the wave vector k and an index j indi- 
cating one of the two directions of polarization. Furthermore we use the relation be- 
tween frequency and wave number 



co;^ = cOk= ck 



(15.129) 



While we initially start with waves normalized in a volume we will eventually go over 
to an integration which is done by the rule 



1 

~r 



A 



I k 
;=1,2 (27T)^ 



(15.130) 



Using (15.127, 129) and (15.130) we may cast (15.128) into the form 

(2?:)^ ImleQ co^ n'j El-E%-h(X)k 

For further evaluation we split the integral over /r-space into one over the space 
angle Q and one over the magnitude of k, i.e. k, 

\d^k = \k^dk\dQ . (15.132) 



We then first perform the integration over the space angle and sum up over the two di- 
rections of polarizations. Since the evaluation is purely formal we immediately write 
down the result 

\dQY,\in\ej-p\n')\^ = An\\{n\p\n')\^. (15.133) 

j 

This leads us to the following result for the self-energy 

^ 3 1 (u r/cu I 4'^ ^ I' . 

{Inf 7,mleoC^ o «' El~E%-hco 



(15.134) 
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A detailed discussion of the sum over n ' reveals that this sum certainly does not vanish 
more strongly than l/o>. We thus immediately recognize that the integral over oj in 
(15.134) diverges which means that the energy shift is infinitely great. This seemingly 
absurd result presented a great difficulty to theoretical physics. It was overcome by 
ideas of Bethe, Schwinger and Weisskopf which we will now explain. 

When we do similar calculations for free electrons we again find an infinite result, 
which can be seen as follows. We repeat the whole calculation above but instead of ei- 
genfunctions y/„ of the hydrogen atom we use the wave functions of free electrons 

y/n(r)->^Wp = Nexp(ip-r/h) (15.135) 

Note that in this formula p is a usual vector whereas p occurring for instance in (15.127) 
and (15.131) is the momentum operator (h/i) grad. Instead of matrix elements, which 
were between the eigenstates of the hydrogen atom, we now have to evaluate matrix 
elements between plane waves. We immediately obtain 



<p'\p\p> = N^\exp{-\p'-r/h)pexp(ip-r/h)d^r (15.136) 

and 

<P'\p\P> = pSp,p' . (15.137) 

Furthermore, we have to make the substitution 

El-E%^Ep-Ep. (15.138) 

but we immediately find 

{Ep-Ep)dp^p. = 0 (15.139) 



on account of (15.137). By putting all the results together we obtain the self-energy of a 
free electron in the form 



£ 



( 2 )_ 
P ~ 



1 






00 



Y, do) . 
0 



(15.140) 



We notice that the self-energy of a free electron of momentum p is proportional to p^. 
Equation (15.140) can be interpreted as giving rise to a shift in the mass of the electron, 
which can be seen as follows: The energy of the free electron without interaction with 
the electromagnetic field (“bare electron”) reads 



Ep = p^/{2 mo) 



(15.141) 



In it, mo is the “bare” mass. The energy shift just calculated is 



AEp^ 



1 2e^ 2 ?^ 

tP • 

(2n) 3moBoC o 



(15.142) 
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Thus the total energy reads 
2 

Ep + AEp = ^— (15.143) 

2 mo 

While the mass of the “bare” electron neglecting electromagnetic interaction is mo, tak- 
ing this interaction into account the electron mass is mo. Note that in this type of con- 
sideration one uses mo and not fho in (15.142). This follows from the “renormalization” 
procedure we will describe now. 

Since we always make observations on free electrons with the electromagnetic inter- 
action present, (15.143) must be just the expression we normally write down for the en- 
ergy of a free electron where mo is the observed mass. Thus we can make the identifica- 
tion 



1 _ 1 
mo mo 



1 



3 mlsoc^ 



00 4 

\dcjo =-z— -25 
0 mo 



(15.144) 



where 25 is merely an abbreviation of the last term of the middle part of (15.144). The 
electromagnetic self-energy can be interpreted as a shift of the mass of an electron from 
its “bare” value to its observed value mo. This shift is called renormalization of the 
mass. 

The argument used in renormalization theory is now as follows. The reason that the 
result (15.134) is infinite lies in the fact that it included an infinite energy change that is 
already counted when we use the observed mass in the Hamiltonian rather than the 
bare mass. In other words, we should in fact start with the Hamiltonian for the hydro- 
gen atom in the presence of the radiation field given by 

- + . (15.145) 

2 mo Ansor 



Then using (15.144) we can rewrite ^ as 






2 mo 



+ 

An^or 



(15.146) 



Thus if we use the observed free particle mass in the expression for the kinetic energy 
(which we always do) we should not count that part of that produces the mass 
shift, i.e. we should regard 

+ (15.147) 

as the effective interaction of an electron of a renormalized mass mo with the radiation 
field. 

Returning then to the calculation of the Lamb shift, we see that to first order in 
e^/hc we must add the expectation value of the second term in (15.147) to (15.134) in 
order to avoid counting the electromagnetic interaction twice, once in mo and once in 
• Thus more correctly the shift of the level n is given by 
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(2)^ 



1 



le^h 

3Wo£oC^ 



oo 

j codoj 
0 



<A?'|p|/2>r <Az|p |^7> 



El-E^n'-hCD 



+ - 






(15.148) 



The second term under the integral in (15.148) can be brought into a form similar to the 
first term under the integral by means of the relation 



{n\p^\n)= Y. {n\p\n’){n’\p\n) (15.149) 

n' 

In order not to interrupt the main discussion we will postpone the proof of this relation 
to the exercises. Using (15.149) and (15.148) we find after a slight rearrangement of 
terms 



g(2)^ 



£-0 
^ n ^ n 



TtV ^ V 3 'L\<n'\p\n)\^ ]d0J - 0 — — 

(In) 3mo£o<^ o En-En'-hco 



(15.150) 



We note that the integral over co is still divergent, however, only logarithmically. This 
divergence is not present in a more sophisticated relativistic calculation. Such a calcula- 
tion yields a result quite similar to (15.150), but with an integrand falling off more rap- 
idly at high frequencies > m^c^. We can mimic the result of such a calculation by 

cutting off the integral at = m^c^/h. The integral can be immediately performed and 

yields 



.(2). 



1 






(In) 'imQEQhc 



E 

n' 



\{n'\p\n)\\El.-E^„)\n 



rripc 

_ pQ 
^ n' ^ n 



(15.151) 



where we have neglected \E^„-E^„'\ compared to rtiQC^. The further evaluation of 
(15.151) requires some formal tricks which are purely mathematical. Namely, 



In 



mpc^ 

(E'i.-El) 



is replaced by an average 



In 



\moC 






Clearly, we may now rewrite (15.151) in the form 



,( 2 )_ 



2e^ 



(2n)^ 'impEphc^ 



In- 



\mpc 



{\El.-El\y - 



l\{n'\p\n)\\El,-El) . (15.152) 



To simplify (15.152) further we use the relation 



Z \<n' \p\n)\\E% -El) = -\{n\[p,[p, 

n' 



(15.153) 
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which will be proved in the exercises. The double commutator on the right-hand side 
can be easily evaluated. We assume .#ei in the form 

--^A + V{r) . (15.154) 

IrriQ 

We readily obtain (cf. Sect. 9.3) 



[Px, ^el] 



h dV(r) 
i dx 



(15.155) 



and in a similar fashion 

[p,lp,^c\]]= . (15.156) 

Using for V the Coulomb potential of the electron in the hydrogen atom, 
V= -eV(4 we can readily evaluate the right-hand side of (15.156). Using a 

formula well known from electrostatics (potential of a point charge!) we find 



A 



1 

r 



-4nS(r) 



(15.157) 



where d(r) is Dirac’s delta function in three dimensions. Using this result and the defi- 
nition of bra and kets (see Problem 9.19), we readily obtain 



(15.153) = — (n A 

2 \ 47reokl 



2 £o 



\\xi/n(r)\^d{r)d^r 



(15.158) 



and making use of the properties of the (5-function 

(15.158) = -^ I V/„(0)|^ . 

2eo 



(15.159) 



We are now in a position to write down the final formula for the renormalized self- 
energy shift by inserting the result (15.156) with (15.153) and (15.159) into (15.152). We 
then obtain 



£(2) = _J_ In 

{Inf 3mlsohc^ <|£’°--£'°|> 



^k«(0)|' 

2£o 



(15.160) 



To obtain final numerical results we have to calculate numerically the average as well as 
I y/„ (0) p. For the hydrogen atom 1 1 //„ (0) p is well known and is nonzero only for 5-states. 
The average value was calculated by Bethe for the 2S level. Inserting all the numerical 
values we eventually find = 1040 megacycles. According to these considerations 
a shift between an S and a P level must be expected. Such a shift was first discovered 
between the 2 Si/i and 2Pi/2 level of hydrogen by Lamb and Retherford. 
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At first sight, it may seem strange that it is possible to obtain reasonable results by a 
subtraction procedure in which two infinitely large quantities are involved. However, it 
has turned out that such a subtraction procedure can be formulated in the frame of a 
beautiful theory, called renormalization, and such procedures are now a legitimate part 
of theoretical physics giving excellent agreement between theory and experiment. Un- 
fortunately it is beyond the scope of our introductory book to cover the details of these 
renormalization techniques. 



Problems 



15.1 A harmonic oscillator with mass m, charge e and eigenfrequency cu is subjected 
to a constant electric field. Calculate the wavefunctions to the first and second approxi- 
mations by perturbation theory, and compare the result with the exact solution and en- 
ergy values (see Problem 9.13). 

As an example, one might choose the perturbation of the « = 1 and n = 2 levels. 

Hint: Use the Z? Z? formalism in perturbation theory and the results of the correspond- 
ing problems from Chap. 9. 



15.2 The rotational motion of a two-atom molecule or of a rotating atomic nucleus 
can be approximately described by the Schrodinger equation 



Mrl 



dd^ 



<p{.d) = E(i>{e), 



where M is a mass and is an “effective” distance. Let the wavefunction 0(^) be 
periodic in the angular coordinate 0: 0(9-\-2n) = 0(0). What are the wavefunctions 
and energy values of this system? Then assume the system is perturbed by the addition- 
al potential a cos (20). Calculate E and 0 in this case using perturbation theory with 
degeneracy. 



15.3 Prove that (15.112) fulfills (15.111) and verify (15.113). 

Hint: Insert (15.112) into (15.111), use the commutation relations (15.107), (15.109) 
and Z?A^o = 0 for all A ’s. 

15.4 Prove that the general wavefunction 

0 = ^ . {bt,r>^ 00 (*) 

fulfills (15.111) and determine E. 

Hint: Insert (*) into (15.111), use the relations 

bAb:r-(b:)b, = nb"-^d,,, , 

(15.109) and *a^o = 0 for all A’s. 
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15.5 Prove<«|p^|rt>= E {n\p\n' ){n' \p\n) . 

n' 

Hint: Write the bra-ket as integrals ly/'^py/„ dxdydz and use the completeness 
relation 

1 Wn’ir)w^’(.r') = d{r-r') {**) 

n' 

where d is Dirac’s function. 

15.6 Prove l\{n'\p\n)\\E%-E’i)= -\{n\[p,[p, 

n' 

Hint: Start from the expression 
I {n\p\n') {n'\p\n)El. 

n' 

and use the fact that the are eigenfunctions of Use the completeness relation 
(**). 

15.7 Show that the Lamb shift can be understood qualitatively by the assumption that 
the zero-point fluctuations of the field (= set of harmonic oscillators!) cause a shift of 
the potential energy of the electron. 

Hint: Write the electron coordinate as 
r' = r + s 

where r corresponds to the unperturbed motion, and s to the field fluctuations. 
Expand 

iWnir) V{r')y/nir)) 

in the first two powers of s, where <5 > = 0 and convince yourself, that an expression 
analogous to (15.158) results. 
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16.1 Symmetries and Selection Rules 

16.1.1 Optical Matrix Elements 

Selection rules and symmetry considerations play a basic role in modern physics. This 
holds for the atomic shells, the nucleus, elementary particles and for many other areas. 
We shall present only a small but typical sample here, taking up the perturbation theory 
discussed in Sect. 15.2. We began with the unperturbed wavefunctions 0^. The per- 
turbation Hamiltonian in one example contained the dipole moment ez. In the follow- 
ing, we shall first choose the x coordinate instead of the z coordinate. We became 
acquainted with matrix elements of the form 

Hl„ = \rmir)ex0„(r)dV (16.1) 

as important parameters. As we observed at that time, there are a number of realistic 
cases in which matrix elements vanish when the indices m and n are equal, on the basis 
of symmetry alone. We shall now examine these cases. 



16.1.2 Examples of the Symmetry Behaviour of Wavefunctions 

As an unperturbed wavefunction, let us consider that of a particle in a box (Sect. 9.1) 
or of a harmonic oscillator. We recognise (see Figs. 16.1, 2) that these wavefunctions 
are either “symmetric” or “antisymmetric”. 





Fig. 16.1. Example of a symmetric wavefunction. 
When reflected through the i/a axis, j/a is converted 
into itself 



Fig. 16.2. Example of an antisymmetric ^diyefwnciion. When the wavefunction 
is reflected through the (/a axis it is transformed into a function with the same 
magnitude but the opposite sign at each point 
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A symmetric wavefunction is converted into itself when every x is converted to -x. In 
an antisymmetric wavefunction, this process reverses the sign of the function. We shall 
now show two things: 

1) How this symmetry property of the wavefunction can be derived directly from 
the Schrodinger equation, without solving it explicitly, and 

2) how these symmetry properties can be used to prove that 

= (16.2) 

We first consider the symmetry properties of the Hamiltonian and select the 
harmonic oscillator as an example. If we replace x by -x, we obviously obtain 

x^^{-xf = x^. (16.3) 

x^ is thus unchanged when x is replaced by -x, or in other words, is “invariant” 
with respect to the transformation 

x-^-x. (16.4) 

Since the potential energy of the harmonic oscillator is proportional to x^, this 
invariance property naturally applies to the oscillator potential itself: 



F(-x)= F(x). 



(16.5) 



In analogous fashion, one can show that the second derivative with respect to x is also 
invariant with respect to the transformation (16.4): 



dx^ d(-x)^ 



dx^ 



(16.6) 



We can now assume in general that for a certain one-dimensional problem, the 
Hamiltonian is invariant with respect to the transformation (16.4). If we take the 
appropriate Schrodinger equation 

jf(x)y/(x) = Ey/(x) (16.7) 

and replace each x by -x, we naturally obtain 



Jf(-x)if/(-x) = Eii/(-x), (16.8) 

However, since the Hamiltonian is supposed to be invariant with respect to the trans- 
formation (16.4), we can replace Jf{-x) by jf{x) in (16.8), 

Jf{x) if/(-x) = Eij/(-x ) . (16.9) 

This tells us that if i//(x) is an eigenfunction of (16.7), i//( -x) is also an eigenfunc- 
tion of (16.7). We now make the simplifying assumption that there is only a single 
eigenfunction for the energy E. The word “single” should be taken with a grain of salt, 
in that eigenfunctions can differ from one another by a constant numerical factor. 
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As we see from (16.7) and (16.8), the eigenfunctions [j/{x) and belong to the 

energy E. They can differ at most by a constant factor, which we shall call a, due to our 



assumption above. We thus have the relation 

y/{-x) = aij/ix) . (16.10) 

If we replace x by -x on both sides of (16.10), the relation becomes 

\l/{x) = ay/i-x) . (16.11) 

We now replace y/(x) on the right side of (16.10) by y/(-x) according to (16.11), 
obtaining as the overall result 

\!/{-x) = ay/{x) = a^y/{-x) . (16.12) 

Since we know that y/ does not vanish identically, we can divide both sides by y/{ -x), 
yielding 

a^=l, (16.13) 

or, taking the square root, 

a=±l. (16.14) 

Inserting this result now into (16.10), we obtain the relation 

y/^-x) = ± \f/{x ) . (16.15) 



This, however, is just the relation we were searching for. As may be seen in Figs. 16.1 
and 16.2, the plus sign means that the wavefunction is symmetric, and the minus sign 
means that it is antisymmetric. We thus see that from the symmetry of the Hamil- 
tonian, it follows automatically that the wavefunctions will have a particular symmetry 
behaviour. 

The argumentation which we have just carried out may be immediately generalised 
to three dimensions, by utilising consistently the replacement 

x-^r (16.16) 

and by replacing the transformation (16.14) by the transformation 

r^-r. (16.17) 

If the Hamiltonian is invariant with respect to (16.17), it follows, in complete analogy 
to the derivation of (16.15), that 

y/(-r)=±y/{r), (16.18) 

The transformation behaviour described by (16.18) is denoted as parity. If the positive 
sign is valid, one speaks of positive (or even) parity; if the negative sign holds, the 
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parity is negative (or odd). Correspondingly, the oscillator wavefunctions of Figs. 
9.9a, b have positive parity for even « = 0,2, 4, ... and negative parity for odd 
/7 = 1 , 3 , 5 , . . . . 

Up to now we have assumed that only a single wavefunction (up to a constant 
factor) belongs to E, or, in other words, that the wavefunctions are not degenerate with 
one another. It can be shown by the detailed theory that even in the case of degeneracy, 
the degenerate wavefunctions may be so defined that here, too, the relation (16.18) is 
fulfilled, as long as the Hamiltonian is invariant with respect to the transformation 
(16.17). 

Before we show by means of symmetry arguments that (16.2) is valid, let us 
consider another example of a symmetry. Keeping in mind the hydrogen atom 
problem, which is three dimensional, we investigate symmetry with respect to rotations 
in three-dimensional space; as a concrete example, we consider a rotation by the angle 
0 around the z axis. We assume, as is for example fulfilled in the hydrogen atom, that 
the Hamiltonian remains unchanged when we rotate the coordinate axes through the 
angle 0 = 0i about the z axis (Fig. 16.3): 

(G0+0i) = Jf(r,0) , (16.19) 

where we have not shown the constant angle 9 as an argument. 




Fig. 16.3. Rotation of the coordinate system around 
the z axis by an angle 0^ 



If, in the corresponding Schrodinger equation, we replace 0 everywhere by 0+ 0j, 
we obtain 

Jf(r,0-h0i)^(r,0-l-0i) = £’^(r,0-h0i) , (16.20) 

or, using rotational invariance, 

^(r,0)|^(/*,0-h0i) = £’i//(g0+0i) . (16.21) 

Since the wavefunction which occurs in (16.21) must be an eigenfunction of the original 
Schrodinger equation, we must have (without degeneracy) in analogy to the example 
treated before. 



^(r,0+0i) = y/(r,0 ) . 



(16.22) 
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Here the constant a can, as we have explicitly indicated, be a function of the angle of 
rotation 0i. We should like to determine this functional dependence exactly. For this 
purpose, we write (16.22) again for a second rotation angle 02- 

V^(r, 4> + <t>2) = </») • (16.23) 

We imagine now that 0 in (16.22) has been replaced by (^+02- This gives us 



y/{r, 0 + 01 + 02> = Vir, 0 + 02) , 
or, using (16.23) on the right side, 
y/{r, 0 + 01 + 02) = i//(r, 0) . 



y/(r, 0 + 01 + 02) = + 02 !//(/•, 0) 

(Fig. 16.4). 




(16.24) 



(16.25) 

in (16.22) on both sides of the equation 
to 

(16.26) 



Fig. 16.4. Addition of rotations: two rotations which 
are carried out one after the other may be replaced by a 
single rotation 



On the other hand, we could have replaced <pi 
by 01 + 02» which would have led immediately 



We now compare (16.26) with (16.25). We see immediately that the a’s are 
connected to one another by the relation 

+ 02 ^01^02* (16.27) 

It may be shown mathematically in an exact way that (16.27) can only be fulfilled by 

a0 = e^'”^ (16.28) 

where m is still an unknown parameter. (Addition of arguments leads to multiplication 
of the functions ^ exponential function.) Now, however, we recognise that every wave- 
function must go into itself when we rotate the coordinate system completely around in 
a circle, that is, when we rotate by an angle of 2 n. From this fact, we obtain directly the 
requirement 

(16.29) 

This may be fulfilled by requiring m to be a positive or a negative integer. 
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To utilise this result, we employ (16.22) again, setting 0 = 0 in it: 

y/(r , 0 + 0y) = a^. y/(r, 0) . (16.30) 

If we leave off the indexy and use (16.28), we finally obtain the relation 

0) = e*'”<*v/(r,0) , m integral . (16.31) 

Thus we have found that the wavefunction ij/ is dependent on the angle 0, and in a way 
which is in perfect agreement with (10.82), the result which we obtained for the 
hydrogen atom. 

As this example indicates, extremely general conclusions about the structure and the 
transformation behaviour of wavefunctions may be drawn from symmetry considera- 
tions. Naturally, one can also consider rotations around other axes, and rotations 
around various axes may also be added to give new rotations. The corresponding trans- 
formation behaviour of the wavefunctions is treated by what is called the theory of 
representations of the rotation group, which, however, is beyond the subject matter of 
this book. We hope, however, that the reader has gained a feeling for the way in which 
the symmetries of the original problem (i.e. of the Hamiltonian) can lead to a certain 
transformation behaviour of the wavefunctions. 



16.1.3 Selection Rules 

We have already met selection rules several times in this book. As we have seen, the 
coupling of an atom to external fields, especially to the radiation field (light), produces 
transitions between the electronic states in the atom. However, these transitions only 
occur (at least in first order) when the corresponding matrix element of the external 
field, H^„, is nonzero. If it is zero, the transition is forbidden; we thus obtain selection 
rules for the occurrence of transitions. We shall now show, using several simple 
examples, how it follows from the symmetry properties of the wavefunctions that 
certain matrix elements are identically equal to zero. This is valid independently of the 
form which the wavefunctions may happen to have. 

As a first example we consider an integral which results from leaving out the factor 
e from the matrix element (16.1). If we replace the functions 0„ and 0„, by the wave- 
function ij/ which we are now considering, we have 

+ 00 

/= I y/*{x)xii/{x)dx , (16.32) 

— 00 



where we have chosen a one-dimensional example (for an intuitive picture of this in- 
tegral, see Fig. 16.5). We rename the variable of integration x, in that we replace x by 
-x: 

(16.33) 

The value of the integral naturally remains the same. However, the following 
individual changes are made in the integral (16.32): 

/= I y,*(-x){-x)y/{-x)d{-x) . 

+ 00 



(16.34) 
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Fig. 16.5. Explanation of the integral (16.32). The inte- 
gral is obtained by multiplying the value of the func- 
tion X {- ■ -) by the value fo the function \y /\^{ — ) at 
each point along the x axis and then integrating the pro- 
ducts over all values of x. As can be seen from the 
diagram, the product for each value of x has the same 
magnitude as the product for -x, but the opposite sign. 
When added, each pair has a sum of zero, and thus the 
integral vanishes 



By exchanging the limits and accordingly changing the sign of the differential, we obtain 
j y/*{-x)(-x)y/(-x)dx . (16.35) 

— c» 

Now we make use of a symmetry property of the wavefunction, namely the trans- 
formation relation (16.15). The transformation (16.4) leaves y/* y/ invariant, so that the 
integral (16.35) differs from the original integral (16.32) only with respect to its sign. 
We thus obtain the relation 

/= -/, (16.36) 

which, of course, can only be satisfied by 

7=0. (16.37) 

This result is enormously important. It shows us that we have found the value of the 
integral to be 0 without having done any integrating. We have used only symmetry rela- 
tionships. 

What would have happened if we had calculated a matrix element with different 
indices m and ai? If the perturbation operator were again jc, we would then have 
obtained the following: if and 0^ have the same parity, the result is still zero. The 
integral can only have a non-zero value if the parity of and 0^ are different. Here we 

have the simplest example of a selection rule. 

As a second example, which is particularly important for atomic shells, we will 
again consider a dipole matrix element, but now between eigenfunctions of the 
hydrogen atom. We shall examine a matrix element of the form 

Iz=Wh,mir)Zy/n’J',m’(r)(^^ ■ (16.38) 

In the following, it is useful to employ polar coordinates, in which (16.38) has the form 

h = Vlt,m(r, 0, ((>)rcosdy/„',i',mir, d, 0) . (16.39) 

In order to determine when the matrix element is identical to zero, we again make 
use of symmetry relationships. This time, however, we consider rotations about the 
z axis. We allow a rotation through the angle 0 q. This converts (16.39) into 
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= (16.40) 

as one can easily see from the transformation property expressed in (16.31). The left 
side must naturally remain I^. Equation (16.40) can be satisfied in two ways. Either 
= 0, or /j + 0, but in the latter case m must be equal to m This is another example of 
a selection rule. If the perturbation operator contains the dipole moment in the z direc- 



tion, a matrix element can only be different from 0 if m = w '. 

We shall now derive a selection rule for the x and y components of the dipole 
moment, by considering the integrals 

Ix=\dV Wnj.mir, 6, 6, <t>) (16.41) 

and 

Iy = \dV wh.mir, 9, (t>)y Wn'j’Mr, 9, (p) . (16.42) 

We shall multiply the second line by i and add it to the first, and also express x and y in 
spherical polar coordinates. We then obtain 

Ix+^Iy = \dVy/*,^^rsm9t^‘^y/„'j’^m' ■ (16.43) 

If we again carry out a rotation through the angle 0o about the z axis, we obtain, in 
complete analogy to the preceding case, 

Ix+ '\Iy = exp[-i(m-l -m')(pQ\{Ix+ '\Iy ) . (16.44) 

From this it immediately follows that 

lx+ily=0 for m4=m' + l (16.45) 

and similarly by subtraction of (16.41) and (16.42) 

Ix-‘\Iy = 0 for (16.46) 

In summary, if m^m' \ and m =1= m' - 1, then 

4=0 and 4 = 0. (16.47) 



Here we have discovered another selection rule: 4 or 4 can have non-zero values only 
when m = m' + 1 or m = m' - 1. 

It can be derived from the rules (16.40, 45 and 46) which transitions in an atom can 
be induced by a radiation field. If the radiation field is polarised in the z direction, only 
transitions with m = m' are possible. However, if it is polarised in the x or the y direc- 
tion, the transitions with 

m = m'±l (16.48) 

are possible. These are the n ox a transitions which were introduced in Sect. 13.3.2. 
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The selection rule of the angular momentum of the electron, 



/ = /'±l, (16.49) 

can be derived from the matrix element (16.38) on the basis of considerations which are 
similar in principle to the above, but require somewhat more extensive mathematics. 
The basic matrix element occurs just when light, or more exactly, light quanta, are 
absorbed or emitted (see also Sect. 16.1.4). Since the total angular momentum of 
electron and light quantum is conserved, it can be deduced from (16.49) that a quantum 
of light has the angular momentum (spin) h. 

We can summarise what has been learned in this chapter as follows: The invariance 
properties of the Hamiltonian operator lead to certain transformation properties of the 
wavefunctions. From the transformation properties of the wavefunctions and of the 
perturbation operator (in the present case this is the dipole operator) it can be 
rigorously determined which matrix elements vanish identically and which can in 
principle differ from zero. However, these considerations give no information about 
the magnitude of the matrix elements, and it can happen that still other matrix elements 
vanish for other reasons. As it turns out, symmetry considerations allow quite exact 
predictions, and it is therefore not surprising that such considerations, appropriately 
generalised, have a basic role in the physics of elementary particles. 



16.1.4 Selection Rules and Multipole Radiation* 

In this chapter we have been primarily concerned with the matrix element 

(16.50) 

We met this earlier, in Sect. 15.3, where we examined the effect of a light wave on an 
atom. Conversely, the matrix element (16.50) is also found to apply to the production 
of light by atomic transitions. Within the limits of this introduction we cannot go into 
details, but we will describe the basic ideas. As we know from classical electro- 
dynamics, an oscillating dipole generates electromagnetic waves. The dipole is mathe- 
matically described by the dipole moment 

P= -er{t) (16.51) 

where r is the vector from the positive charge to the oscillating negative charge. As 
usual, we assume that the oscillation is purely harmonic, r{t) = sino»t. The dipole 
moment (16.51) then appears in the Maxwell equations for electromagnetism as a 
“source term”. (For the more interested reader, we can formulate this somewhat more 
exactly: In the Maxwell equation curl// =j + dD/dt, the polarisation P appears in D if 
matter, in the present case atoms, is present. In the classical case, this can be expressed 
by (16.51) or, if several atoms are present, by a summation of several expressions of the 
form of (16.51).) The question now arises whether there is a quantum theoretical 
analogue to this source term of the form of (16.51). Let us recall the translation table in 
Sect. 9.3.4, and accordingly, assign to the classical observable “dipole moment 
-er{t)” an operator -er and, finally, the expectation value 



\y/*(r,t)(- er) t//(r, t)dV . 



(16.52) 
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What does the wavefunction y/{r, t) mean here? To obtain an insight, let us imagine 
an atom with two energy levels and Ei and the related wave functions 0 \(r) and 
(pi{r). To generate an oscillation, we form a superposition in the form of a “wave 
packet”, 

H/{r,t) = , (16.53) 

1/2 1/2 

where the factors \/\/2 serve to normalise y/ir). Let us substitute (16.53) in (16.52) and 
multiply out the individual terms. The result is a sum over expressions of the form of 
(16.50), in which m and n can take on the values 1 and 2. Let us assume, as above, that 

\ 0 xer 0 xdV and \ 0 ier 02 dV 

vanish. Then (16.52) takes the form 

\\(t>nr){-er)(D 2 {r)dV-^-''^‘+cx.\ (16.54) 

where we have also introduced the abbreviation a> = {E 2 ~Ex)/h, The expectation value 
of the dipole moments thus actually oscillates as a classical dipole and generates the 
corresponding classical electromagnetic field. In this way the relationship between a 
dipole matrix element and dipole radiation becomes clear. 

Now as we know from classical physics, radiation fields are generated not only by 
oscillating dipoles, but also by other oscillating charge or even current distributions. A 
loop conducting an electric current acts like a magnetic dipole. If we vary the current in 
the loop, for example in a sine wave, the magnetic dipole moment oscillates and 
generates “magnetic dipole radiation”. Can such a radiation field also be generated by 
an atomic transition? This is in fact possible, but the matrix element (16.52) or (16.54) 
is no longer sufficient to describe it mathematically. Instead we must base it on a more 
exact theory of the interaction between light and electrons. 

With the exception of constants, the corresponding part of the Schrodinger 
equation is 

epA, (16.55) 

where p = -\hS 7 is the momentum operator and A is the vector potential of the light 
field. We have already encountered the interaction expression (16.55) in Sect. 14.1, e.g. 
in (14.9). There the vector potential referred to a constant magnetic field; here it 
describes the electromagnetic field. If we express^ in terms of plane waves (in complex 
notation), we have instead of (16.55) expressions of the form 

ep-et*’’ (16.56) 

in which e is the polarisation vector of the light wave with the wavenumber vector k. 

The matrix element which now appears in perturbation theory, 

e\4>n(>‘)p-ee'’‘'''0mir)dV, (16.57) 



1 



Here and in the following, c.c. indicates the complex conjugate of the preceding expression 
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takes the place of (16.50). Since the wavelength X is generally large compared to the 
extent a of the wave function 0, we can assume k - r = lnr/X< \ and expand the 
exponential function 

e‘*''»l + i*-r+... . (16.58) 

If we substitute this in (16.57), the first term is 

\^t{r)ep-e,t>m{r)dV. (16.59) 

In the special case that e is parallel to the x axis and n = \,m = l,'fit have 

\(ptir)epx 02 (i')dV. (16.60) 

As is demonstrated exactly in quantum mechanics, (16.60) is identical to 

-imo)\(pf{r)ex<p 2 {r)dV where co = {Ei- E\)/ h , m: particle mass, (16.61) 

or, except for a numerical factor, the familiar dipole matrix element. There are cases of 
atomic transitions, however, where (16.61) vanishes. (These are “forbidden electric 
dipole transitions”.) Then the matrix element derived from the second term in (16.58), 

\e\ 0 tir){p -e){k- r) 02 ir)dV, (16.62) 

becomes important. This can also be rearranged (which will not be demonstrated here) 
and becomes (except for the factor u>me) 

1 0f (r) [-er{k-r)](l> 2 ir)dV. (16.63) 

To establish the connection with classical physics, let us think of the integral over r 
being replaced by a discrete sum over points r,, each with the charge {-ej). Then 

(16.63) ^ E ( - ej)rj(k ■ rj) . (16.64) 

j 

However, just this sum appears as the “source term” in the classical theory of electro- 
magnetic fields (it appears as the “Hertz vector”). It is known from this theory that the 
sum represents a superposition of the electric quadrupole moment and the magnetic 
dipole moment, the oscillations of which produce the corresponding radiation fields. 
(In the case of the magnetic dipole, use is made of the fact that r = i ojr.) Thus we 
recognise a one-to-one correspondence between quantum theory and classical physics 
with regard to the radiation properties of atoms and classical sources, just as the Bohr 
correspondence principle requires. 

The radiation fields of electric and magnetic dipoles are shown in Figs. 16.6 and 
16.7. The theory sketched above, in which the radiation field is treated classically, 
according to the Maxwell equations, but the “source term” is introduced as a quantum 
mechanical expectation value, is often referred to in the literature as the “semiclassical” 
theory. 
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z 




Fig. 16.6. Radiation field for electric dipole radiation. The 
dipole oscillates In the z direction. The lines connect 
elements of the electric field E which have the same phase. 
The radiation field of magnetic dipole radiation is formally 
the same as that for electric dipole radiation if we replace 
the electric dipole moment p by the magnetic dipole moment 
p and simultaneously make the substitution E-^B and 
B^E 



t = 0 




Fig. 16.7. Visualisation of the time dependence of the 
radiation of a Hertzian dipole. Tis the period of oscillation 




16.2 Linewidths and Lineshapes 

As we showed in the Schrodinger theory, electrons can assume certain energy states in 
the atom. If an electron is in an excited state, it can go from there to an energetically 
lower state by emitting a quantum of light. The result of this is that the lifespan of the 
excited state is no longer infinite. In classical electrodynamics, it is shown that the 
energy of a Hertzian oscillator ( = oscillating dipole) decays exponentially with time. 
Measurements on excited atoms show (Fig. 16.8) that their radiation intensity also 
decays exponentially. Such behaviour is to be expected, according to the corre- 
spondence principle (Sect. 8.11). In fact, the quantum mechanical treatment of radia- 
tion, which cannot be given here in detail, shows that the number N of excited atoms 
decreases according to 

N = (16.65) 

1/(2 y) is the time in which the number decays to N/t. to is called the lifetime of the 
state. In the sense of the statistical interpretation of quantum mechanics, (16.65) refers 
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Fig. 16.8. The decay times of excited atomic states can be measured, for 
example with the arrangement shown here. Atoms in a neutral beam are lifted 
into defined excited states by irradiation with electrons of the appropriate 
kinetic energy or with laser light. A movable device for measuring radiation 
intensity along the line of flight is used. The velocity of the atoms is known, so 
that the radiative decay of the excited state during the flight through the 
chamber can be used to calculate the decay time, or lifetime, of the excited 
state 



to the quantum mechanical average which describes the behaviour of many atoms, i.e. 
of the ensemble. For a single atom, the emission occurs at a completely random time. 
In analogy to (16.65), the amplitude of the emitted light decreases exponentially. The 
radiation field amplitude F therefore has the form (Fig. 16.9) in complex notation 

/T(^) = 7T(,(e-J''+‘"o'+c.c.) , t>0. (16.66) 




Here cuq is given by {Eo~Ef)/h, where Eg and F^are the energies of the initial and final 
atomic states, respectively. Fq is a real amplitude. 

We can imagine that the excitation occurs at time t = 0, so that for / < 0 there is no 
light wave. If we use a spectrometer to examine the emitted light, we find that this light 
is composed of monochromatic components, i.e. waves of the form 



c(a>) e'"' (in complex notation) , 
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where a> = 2nc/X (A: wavelength, c: velocity of light). The radiation field amplitude 
(16.66) can be represented as a superposition of such waves: 

F{t) = — ] c{(o)e^‘dco. (16.67) 

2 7T - 00 

This decomposition is known in mathematics as a Fourier transformation. It is shown 
in Fourier theory that the Fourier coefficients c(co) are given by 

c((d) = T . (16.68) 



The intensity of monochromatic light with frequency oj is then given by 

\c(co)\\ (16.69) 

For an exponentially decaying light field (16.66), the spectral distribution is thus 



c(co) 



1 



1 



i((Oo-(o)-y i(-coo-(o)-y 



(16.70) 



Since (coo~r^) ^(<yo+w) and y<^((«o+<y). (cu>0), the second term in (16.70) is 
much smaller than the first and can be neglected. The intensity distribution is then 



|c(cu)|^ = Fo 



1 



(co-a>of+y^ 



(16.71) 




Fig. 16.10. Lorentzian line 



A spectral line with the intensity distribution given in (16.71) and shown in Fig. 
16.10 is called a Lorentzian line. Its width at half height is given by the atomic decay 
time to according toy = 1/(2 to). The linewidth introduced here is also called the natural 
linewidth. Typical numerical values are 10"^- 10"^ s (10"^ s :^5 • lO'^^cm*"^ ^ 15 
MHz), and thus y = 10*- 10^ s“‘. 

It is intuitively reasonable that the electron orbitals of gas atoms can also be dis- 
turbed by collisions between the atoms. This constantly causes changes in the resulting 
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light emissions, resulting in a line broadening which is called collision broadening. If 
atoms are incorporated into solids, they interact constantly with the lattice oscillations, 
which again disturb the electron orbitals and lead to line broadening. In these cases, the 
atoms are all completely identical, and the resulting line broadening is “homo- 
geneous”. “Inhomogeneous line broadening” occurs when individual atoms, which 
were originally identical, become distinguishable through additional physical condi- 
tions. For example, atoms in solids can occupy different types of positions within the 
lattice, so that the energies of individual electrons are differently displaced. These dis- 
placements often lie along a continuum and their intensities assume a Gaussian dis- 
tribution. 

Another example of inhomogeneous line broadening is Doppler broadening in 
gases. According to the Doppler principle, the frequency of light emitted by an atom 
moving toward the observer with velocity v differs from that emitted by an atom at 
rest by 

0 ) - £Uo(l ± v/c ) , (16.72) 



where c is the velocity of light. The frequency is increased when the atoms are moving 
toward the observer, and decreased when they are moving away. Let us now imagine a 
gas at thermal equilibrium. According to the Boltzmann distribution (2.8) the number 
of gas atoms whose velocity components v in the direction of the observed light wave lie 
in the interval (v,v-\-dv) is given by 




(16.73) 



N is the total number of atoms, k is the Boltzmann constant, T the absolute tempera- 
ture and Wo the atomic mass. (The difference between (2.7) and (16.73) is based upon 
the fact that (2.7) refers to the three dimensional motion of the gas atoms, while (16.73) 
refers to one velocity component v.) Since, according to (16.72), a frequency displace- 
ment is associated with the velocity v, we obtain an intensity distribution given by 



I(co) = const • exp 



moC^(a>o-ajf 

IkTcol 



(16.74) 



as is shown in Fig. 16.11. The total linewidth at half height is given by 



zlcoD = -^^(21n2--^ 

c V 
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According to (16.75), the Doppler width of a spectral line, Aco^ = 2nA Vd is thus pro- 
portional to both the frequency and the square root of the temperature. For the yellow 
D line of sodium, at r= 500K, the Doppler width is ZIvd= 1700 MHz, or 
(1/A)d = 0.056 cm"^ For optical spectral lines, this width is generally significantly 
greater than the natural linewidth y [see (16.71)]. 

In contrast, the Doppler broadening of microwave or radiofrequency transitions 
between excited atomic states (which can be studied by double resonance methods, see 
Sect. 13.3.7) is generally smaller than the natural linewidth. At frequencies v < 10^^ Hz, 
the Doppler width at the same temperature is, according to (16.75), <10"^ Hz. The 
spectral resolution of such double resonance methods is thus no longer limited by 
Doppler broadening. 

Doppler broadening also occurs in atoms which are incorporated into solids and are 
vibrating there at high temperatures. 

As we saw in Chaps. 8 and 12, there are a number of interesting line shifts and split- 
tings. In order to measure these exactly, the linewidth must be small compared to these 
shifts or splittings. Therefore, in Chap. 22 we shall discuss methods by which line 
broadening, and in particular Doppler broadening, can be avoided, thus permitting the 
experimenter to carry out Doppler-free spectroscopy. A simple method of reducing the 
Doppler broadening consists of cooling the source of the atomic beam which is being 
investigated. In order to reduce collisional broadening, one must ensure that the mean 
time between two collisions of one atom with other atoms is longer than the mean life- 
time of the excited state. This is done by reducing the pressure in the experimental 
apparatus. 
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17.1 The Spectrum of the Helium Atom 

The simplest many-electron atom is the helium atom. In the ground state, its two 5 
electrons exactly fill the innermost shell with the principal quantum number n = \. 
There is no room for more electrons in this shell, as we shall see below. 

In the excited state, one electron remains behind in the half-filled first shell, while 
the other is excited into a higher shell. Therefore we have 

electron 1 in state n = \ / = 0 , and 

electron 2 in state «>1, 1=0 . 

In the previous chapters, we treated the spectra of atoms in which the quantum 
numbers of only one electron were sufficient to characterise the terms. Other electrons 
were either not present - as in the H atom - or they were all in so-called closed shells 
or closed subshells. That means, as we shall also see presently, that they make no con- 
tribution to the total angular momentum or magnetic moment of the atom. 

The experimentally derived term scheme for helium (Fig. 17.1) is similar in some 
respects to those of the alkali atoms. It differs from them, however, in that there are 
two term systems which do not combine with each other, as if there were two kinds of 
helium atoms: a singlet and a triplet system. These names come from the fact that in the 
singlet system, all the terms are single, while in the triplet system, they are generally 
split into triplets. 

What is important for practical applications, such as gas discharges and lasers, is 
that both the lowest state of the triplet system (2^5 in Fig. 17.1) and the second-lowest 
state of the singlet system (2^5 in Fig. 17.1) are metastable in the helium atom. 
“Metastable” means that the lifetime of the system is long compared to 10"^ s, which is 
the usual lifetime of a state which can be emptied by an allowed optical transition. An 
atom which is excited to one of these states can thus radiate its energy of about 20 eV 
only in a time that is long compared to 10"^ s. 

Helium in the singlet state is also called parahelium. Unlike the alkali atoms, it has 
no fine structure. All its lines are single. The lowest term is given the symbol 1 ^S. Here 
the first 1 stands for the principal quantum number, the superscript 1 for the 
multiplicity (here singlet), and the letter S for the total orbital angular momentum, 
which in this case is zero. The higher terms are 2 ^S, 2 ^P, 3 ^S, 3 3 ^D, etc. From the 

lack of fine structure, one can conclude that the spins of the two electrons are 
antiparallel, and add vectorially to a total spin S = 0. The same holds for the magnetic 
moment, jus = 0. Here the upper case letters indicate quantum states which result from 
the coupling of many (here 2) electrons. 

Triplet helium, which in contrast to the singlet system has fine structure, is called 
orthohelium. Its lowest level is 2 ^S. Here the 2 stands for the excited electron with 
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Fig. 17.1. Term scheme of the He atom. 
Some of the allowed transitions are in- 
dicated. There are two term systems, be- 
tween which radiative transitions are forbid- 
den. These are the singlet and triplet systems. 
The transitions in the singlet system span an 
energy range of 25 eV, while those in the 
triplet system span only 5 eV. The threefold 
splitting of the triplet terms (except for ^S) 
due to spin-orbit coupling is not indicated 
here. The indices 0, 1, and 2 on the term 
symbol 1,2 stand for the 3 possible quan- 
tum numbers 7=0, 1 , 2 



n = 2, the superscript 3 for the multiplicity (triplet), and the letter S for L = 0. Unfor- 
tunately, both the total spin quantum number and a term with the total orbital angular 
momentum L = 0 are represented by the same letter, S. The reader must be aware of 
this difference. 

In this system, the spins of the two electrons are parallel to each other. The quan- 
tum number of the total spin is s^ + S 2 = S = 1 . The magnetic moment jUs = jUs^ + Ms 2 is 
different from zero. The resulting total spin has three possible orientations with respect 
to an internal magnetic field Bi which is coupled to the orbital angular momentum of 
the electrons. The spin-orbit coupling resulting from this leads to a triple fine structure 
splitting of the terms which have non- vanishing total orbital angular momentum. 

The spectrum of parahelium lies mostly in the ultraviolet, while that of orthohelium 
is in the infrared and visible. Combinations between the systems are not observed, i.e. 
there are no optical transitions between the singlet and triplet systems. If one compares 
corresponding configurations in the two non-combining systems, one discovers con- 
siderable differences in energy, especially for the low quantum numbers. The 2 ^Sq state 
lies about 0.8 eV above the 2 ^Si, and the 2 state about 0.25 eV higher than the 2 ^P 2 
(the notation in this sentence is explained in Sect. 17.3.2). This energy difference 
between the singlet and triplet configurations is a result of the differences in the 
electrostatic interactions of the two electrons in parallel and antiparallel spin orienta- 
tions. It is also called the symmetry energy, because it arises from the difference in 
the average distance between two electrons with symmetrical and antisymmetrical 
wavefunctions (see Sect. 17.2). For P states, this can also be seen in Fig. 17.9. 
More information about the theory will be given in Chap. 24, “the quantum theory 
of chemical bonding”; in particular, see Fig. 24.7. 




17.2 Electron Repulsion and the Pauli Principle 



299 



17.2 Electron Repulsion and the Pauli Principle 

Compared to the one-electron atom, the new factor in the helium atom is the repulsive 
interaction of the two electrons. For the total binding energy of the He atom, one must 
therefore write 




Ane^rx Amari Am^rxi 



Nucleus - 


Nucleus - 


Electron 1 - 


electron 1 


electron 2 
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'-2 


hi 



(17.1) 



The repulsion energy of the two electrons naturally depends on the «, / states they 
occupy, because the spatial distribution of the electrons depends on the quantum 
numbers. This repulsion energy thus lifts the / degeneracy to a considerable degree. 

The Schrodinger equation of this relatively simple two-electron problem cannot 
be solved exactly. The potential no longer has spherical symmetry and a separation 
into radial and angular parts is no longer possible. As a first approximation, in the 
independent-particle model, one neglects the third term and sets the total energy 
equal to the sum of two H-atom terms. The binding energy is then 



RhcZ^ \ _ ( R he Z^ \ 
)\ \ /2 



where the indices 1 and 2 refer to the two electrons. One would then expect the energy 
of the ground state to be 

^He = 2-(-54.4)eV= - 108.8 eV. 

The experimental value is distinctly different from this. The total work of removing 
both electrons is 79 eV, 24.6 eV for removing the first electron (ionisation of the He to 
the singly charged positive ion He"^) and 54.4 eV for the removal of the second electron 
(ionisation of the singly charged He"^ to the doubly charged positive ion He^'^). The 
second value is the same as one would expect from a comparison with the hydrogen 
atom. There the ionisation energy is 13.6 eV. For helium, one would expect the energy 
to be four times as great, because the nucleus is doubly charged. The work of removing 
the first electron, however, is much smaller. The model for the binding energy must 
therefore be refined by taking into account the energy of the interaction of the two elec- 
trons. We shall present an approximation process in Sect. 19.4. 

The observation that the helium atom has a 1 state but not a 1 ^5 state was the 
starting point for the Pauli principle {Pauli 1925). In its simplest form, it says: 

The electronic states of an atom can only be occupied in such a way that no two 

electrons have exactly the same set of quantum numbers. 

The electrons must therefore differ in at least one quantum number. In addition to the 
orbital quantum numbers n, I and /??/, the spin quantum number 5 or m^ is taken into 
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account here. An atomic state with a certain set of spatial quantum numbers {n, I, 
Ml) can thus be occupied by at most two electrons. Their spin (projection) quantum 
numbers ms = ±1/2 must be different. In the 1^5 configuration, both electrons would 
have exactly the same quantum numbers, as we shall show in detail below. For 
more details see Sect. 19.4. 

This principle is the generalisation of the previously stated empirical rule for all 
atoms with more than one electron: There is always a unique ground state which has 
the lowest principal quantum number. It has the highest multiplicity which is compati- 
ble with the principal quantum number. We discuss this point further in Sect. 19.3. 

17.3 Angular Momentum Coupling 

17.3.1 Coupling Mechanism 

We have already learned that in the one-electron system, the individual angular 
momenta / and s combine to give a resultant angular momentum j . There is a similar 
coupling between the angular momenta of different electrons in the same atom. We 
have already inferred, as an important result of empirical spectral analysis, that the 
total angular momentum of filled shells is equal to zero. This follows directly from the 
observation that the ground state of all noble gas atoms is the ^Sq state. In calculating 
the total angular momentum of an atom, it is therefore necessary to consider only the 
angular momenta of the valence electrons, i.e. the electrons in non-filled shells. These 
angular momenta are coupled by means of magnetic and electric interactions between 
the electrons in the atom. They combine according to specific quantum mechanical 
rules to produce the total angular momentum / of the atom. These quantum rules are 
those which have already been discussed. The vector model provides insight into the 
composition of the angular momentum. There are two limiting cases in angular 
momentum coupling: the LS or Russell-Saunders coupling, and jj coupling. 

17.3.2 LS Coupling (Russell-Saunders Coupling) 

If the interactions (Sj • //) between the spin and orbital angular momenta of the indi- 
vidual electrons i are smaller than the mutual interactions of the orbital or spin angular 
momenta of different electrons [coupling (//-/y) or the orbital angular 

momenta // combine vectorially to a total orbital angular momentum L, and the spins 
combine to a total spin 5. L couples with S to form the total angular momentum /; 
cf. Fig. 13.11. 

Fig. 17.2. Coupling of the orbital angular momenta 
of two electrons and I 2 to the total angular 
momentum L. Coupling to D, P and 5, correspond- 
ing to X = 2, 1, and 0. The picture is schematic. 
Exact addition of the angular momenta is accom- 
plished as shown in Fig. 12.16 

For a two-electron system like the He atom, the resulting behaviour is shown in 
Fig. 17.2. The orbital angular momentum L of the atom is the sum of the two electron 
orbital angular momenta 




D term P term S term 



L = h + h, 



(17.3) 
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For the absolute value of L it again holds that \L \ = ]/L(L + \)h with the quantum 
number L, for which the following values are possible: 

L = /i + / 2, /1 + / 2 -I.../ 1 -/ 2 . where 

The quantum number L determines the term characteristics: 

L = 0, 1 , 2 . . . indicates S,P,D ... terms . 

It should be noted here that a term with L = 1 is called a P term but this does not neces- 
sarily mean that in this configuration one of the electrons is individually in a /» state. 
For optical transitions the following selection rules hold: 

Zl / = ± 1 for the single electron , 

AL = 0,±\ for the total system . 

AL = Q means here that the quantum states of two electrons change simultaneously, 
and in opposite directions. This is only possible when the coupling is strong, which is 
the case in heavy atoms. 

Furthermore, for the total spin, 

5 = 51 + ^2 with I5| = l/S(5+I)fi. (17.4) 

The total spin quantum numbers S can take one of two values here, either 

5=172 + 1/2 or 5=1/2-172, i.e. 5 = 0 or 5=1. 

The selection rule for optical dipole radiation is Zl 5 = 0. This means that combina- 
tions between states with different total spins are not allowed, or in other words, that 
spin flipping is not associated with optical dipole radiation. 

Finally, the interaction between 5 (or its associated magnetic moment and the 
magnetic field 5^, which arises from the total orbital angular momentum T, results in a 
coupling of the two angular momenta L and 5 to the total angular momentum /: 

J = L + S, \J\ = \/J(,J^\)h. (17.5) 

The quantum number J can have the following values: 

for S = 0 , J = L , 

for S=l, y = L + l,L,T-l, 

in which case all terms are triplets. 

The individual angular momenta are combined according to exactly the same quan- 
tum rules with which we became acquainted in Chap. 13. 

In the general case of a several-electron system, there are 2S-(-l possible orienta- 
tions of S with respect to L, i.e. the multiplicity of the terms is 2S-I- 1 (if S<L). As an 
example. Fig. 17.3 shows the possible couplings in the case of S = 1, L = 2. 
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S=1 



L=2 



]=3 




Fig. 17.3. Combination of spin (5) and orbital (L) 
angular momenta to form the total (/) angular 
momentum. If L = 2 and 5=1,/ can assume the 
values 3, 2, or 1. This is a schematic representation. 
Figure 12.16 explains the method of exact addition 
of the angular momenta in the vector model 



Using the helium atom as an example, we shall show once more which atomic terms 
can be derived from the given electron configurations. The lowest states of the helium 
atom have the following term symbols and quantum numbers: 

- If both electrons are in the lowest shell, the electron configuration is 1 5 1 5 , or in 
the common notation 1 5^. They then have the following quantum numbers: 

n^ = ri2=\, /i = /2 = 0, 5 i = 1/2, 52=1/2. 

The resulting quantum numbers for the atom are then either 

L = 0 , S = 0 for = / = 0 , 

the singlet ground state ^Sq; or 

L = 0, S=1 for = /=1, 

the triplet ground state Only the singlet ground state is actually observed. The 
triplet ground state is forbidden by the Pauli Principle, because in it the two 
electrons would not only have the same quantum numbers n and /, but they would 
also have the same spin orientation m^, 

- However, when one electron remains in the shell with « = 1 , while the other is raised 
into the state with n = 2, that is in the electron configuration 1 ^ 25 , we have the 
following quantum numbers: 

«i = l, «2 = 2, /i = /2 = 0, 5 i = 1/2, 52=1/2. 

This yields either Z = 0, S = 0, /= 0, the singlet state ^Sq; or L = 0, S = 1, /= 1, 
the triplet state ^Si. Both states are allowed and are observed. 

- In the same way, the states and term symbols can be derived for all electron 
configurations. More on this subject is to be found in Sect. 19.1. 

We finally arrive at the term scheme with the allowed optical transitions by taking 
into account the selection rules. Intercombination lines with AS=i are forbidden, 
because there is no spin flipping with optical dipole radiation. This is the reason for the 
existence of non-combining term systems like that in Fig. 17.1. 

The complete nomenclature for terms or energy states of atoms, which we have 
already used, is then 
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One first writes the principal quantum number n of the most highly excited 
electron, which is called the valence electron. The superscript is the multiplicity 2S + 1 . 
It is followed by the alphabetical symbol S, P, D... for the total orbital angular 
momentum L; the subscript to this symbol is the quantum number J for the total 
angular momentum of the atom. 

For many-electron systems, this must be expanded over the possible multiplicities as 
shown below: 



For two electrons 


s = o 


5=1 






Singlet 


Triplet 




For three electrons 


5=1/2 


5 = 3/2 






Doublet 


Quartet 




For four electrons 


5 = 0 


5=1 


5 = 2 




Singlet 


Triplet 


Quintet 


For five electrons 


5=1/2 


5 = 3/2 


5 = 5/2 etc. 




Doublet 


Quartet 


Sextet 


According to the rules 


for coupling of angular momenta, which have been treated in 



this section, the possible atomic states can now be easily derived from known electron 
configurations. We shall only explain a simple example here. In the configuration 
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Fig. 17.4. Term diagram for the nitrogen atom (only simple terms, no J 
splitting). Nitrogen as a doublet and a quartet system. The electronic con- 
figuration of the valence electrons is given at the top 



Fig. 17.5. Term diagram for the carbon atom (only simple terms, no J split- ► 
ting). Carbon has a singlet and a triplet system. The configuration of the 
valence electrons is given at the top 
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nsn's, i.e. with two non-equivalent 5 electrons, the total orbital momentum is neces- 
sarily L = 0, because l^ = l 2 = 0. The spins of the two electrons can be either parallel or 
antiparallel. This means that the total spin quantum number S = 1 or S = 0. In either 
case, the total angular momentum J = S. The possible terms of the configuration 
ns n's are therefore the triplet term and the singlet term ^Sq. 

We shall discuss other examples in Chap. 19, when we know somewhat more about 
the energetic order of these possible terms. 

To make the above somewhat more clear. Figs. 17.4 and 17.5 show the Grotrian 
diagrams for the nitrogen and carbon atoms. Here, only those terms are taken into 
account which result from the excitation of a single electron, the so-called valence elec- 
tron. Of the remaining electrons, only those need be considered which are in non-closed 
subshells. The remaining electrons form the spherically symmetric core of the electron- 
ic shell. The terms and term symbols resulting from LS coupling and from the quantum 
states of the individual electrons can now be immediately understood, and can be 
derived from the electron configuration, which is also given. We shall return to these 
many-electron atoms in Chap. 19. 



17.3.3 jj Coupling 



The second limiting case for coupling of electron spin and orbital angular momenta is 
the so-called jj coupling, which occurs only in heavy atoms, because the spin-orbit 
coupling for each individual electron increases rapidly with the nuclear charge Z. 

In jj coupling, the spin-orbit interaction (//• si) for a single electron is large com- 
pared to the interactions (//• Ij) and (sr sj) between different electrons. This type of 
coupling is shown schematically in Fig. 17.6b. For comparison, LS coupling is shown 
in Fig. 17.6a. 





Fig. 17.6. a) Schematic representation of LS 
coupling between two electrons, b) jj coupling 
between two electrons. In jj coupling, the 
angular momenta L and S are not even defined 



In jj coupling, the angular momenta of individual electrons couple according to the 
pattern 

h + S\-^j\ , h+S2-^j2y 

and so on to give individual total angular momenta j. These then combine vectorially to 
give the total angular momentum J of the atom. Here / = Z y / and 

\j\ = \/j{j+\)h. 

In this type of coupling, the quantum number J arises from a generalised quantum 
mechanical vector model. A resultant orbital angular momentum L is not defined. 
There are therefore no term symbols S, P, D, etc. One has to use the term notation 
Uuji) etc., where j are the angular momentum quantum numbers of the individual 
electrons. It can easily be shown that the number of possible states and the / values are 
the same as in /S coupling. 
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Fig. 17.7. Simplified energy diagram of the mercury atom as an example 
of a heavy atom with term symbols corresponding to L5-coupling. The 
wavelengths [A] of a few of the more important lines are given. The 
strongest line in the spectrum of a mercury lamp is the line at 2537 A which 
results from the intercombination of the and the 6^P| states. Inter- 
combination between terms of different multiplicity is strictly forbidden in 
light atoms. In heavy atoms it is possible. This line corresponds to the 
energy loss of 4.9 eV in the Franck-Hertz experiment, Sect. 8.8 
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Fig. 17.8. Photographically recorded spectrum of a low-pressure mercury 
lamp: segment between 2500 and 5800 A. Due to the superposition of the 
different series in the spectrum, which are shown in the energy diagram 
Fig. 17.7, a series structure cannot be immediately recognised in the spec- 
trum of a heavy, several-electron atom, as might be inferred from the 
energy diagram 



Purely jj coupling is only found in very heavy atoms. In most cases there are inter- 
mediate forms of coupling, in such a way, for example, that the intercombination be- 
tween terms of different multiplicity is not so strictly forbidden. This is called inter- 
mediary coupling. The most prominent example of this is the strongest line in the spec- 
trum of mercury high-pressure lamps, A = 253.7 nm (compare the energy diagram in 
Fig. 17.7 and the photograph of the spectrum of a low-pressure mercury lamp in 
Fig. 17.8). In the frequently used high-pressure mercury lamps, the intensity distribu- 
tion of the spectrum is different from that of a low-pressure lamp. The 253.7 nm line is 
relatively most strongly emitted, in addition broadened, and reabsorbed. This is an 
inter combination line between the singlet and triplet systems. The selection rule for 
optical transitions is Zl/ = 0, ± 1 , and a transition from / = 0 to / = 0 is forbidden. An 
example for the transition from LS to jj coupling is shown in Fig. 17.9. 

LS jj 




C Si Ge Sn Pb 

2p3s 3p4s 4p 5s 5p6s 6p7s 



Fig. 17.9. Transition from LS coupling in light 
atoms to jj coupling in heavy atoms in the series 
C-Si-Ge-Sn-Pb. In carbon the 
and - ^P 2 distance are 20 and 40 cm “ \ while 
the ^Pi-^P 2 distance is 1589 cm~^ The term 
nomenclature is that of LS coupling for carbon, 
and of jj coupling for lead. The quantum sym- 
bols of the two outermost electrons are given. 
The symbol (3/2, 1/2)| means j^ = 3/2,72 = 1/2, 
J= 1 
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17.4 Magnetic Moments of Many-Electron Atoms 

Having calculated the total angular momentum of many-electron atoms, we shall now 
also calculate the total magnetic moment. The treatment exactly follows that of the 
one-electron system in Chap. 13. In the case of LS coupling, the magnetic moment is 
composed of 



HL+Hs = fij- 



Here fii is antiparallel to L and ns is antiparallel to S, but because of the different g 
factors of orbital and spin magnetism, // ^and / are not antiparallel to each other. Their 
directions do not coincide; instead the total moment j precesses around the direction 
of J. As was mentioned in Chap. 13 and illustrated in Fig. 13.9, the observable 
magnetic moment is only that component of juj which is parallel to /. For this 
component as was shown in Sect. 13.5, 



\iMj)j\ - 



3/(7+l) + S(S+l)-L(T-H) 
21/7(7+1) 



• /Mb = 1 / 7 ( 7 + 1)//B 



(17.6) 



with the Lande factor 



7(7+ 1) + S(S+1) -7(7 + 1) 
27(7+1) 



(17.7) 



In one chosen direction z, the only possible orientations are quantised and they are 
described by whole or half-integral values of the quantum number m j, depending on 
the magnitude of 7. 

iHj)j,z = (17.8) 

with 

my = 7, 7- 1 , ... - 7 . 

The contents of this chapter are the quintessence of years of spectroscopic work; 
measurement of spectra, setting up of term schemes, determination of quantum num- 
bers, and so on. Measurements in magnetic fields have also been an essential tool. If 
one applies the insights discussed in Chap. 13 to many-electron atoms, one can deter- 
mine the magnetic quantum numbers of the atomic states from measurements of the 
splitting of spectral lines. The same considerations which were discussed in Chap. 13 on 
the behaviour of atoms in magnetic fields apply to many-electron atoms. Here too, the 
normal and anomalous Zeeman effects and the Paschen-Back effect are important 
limiting cases. The LS coupling can be broken in sufficiently strong magnetic fields, 
and in very strong fields, even the jj coupling breaks down. 

The atomic magnetic moments can also be measured by determining the macro- 
scopic material constant magnetic permeability, p,, according to the equation 
B = {H is the magnetic field strength). Details are given in H. Haken and H. C. 

Wolf, Molecular Physics and Elements of Quantum Chemistry, Sect. 3.6. 
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17.5 Multiple Excitations 

Let it be only briefly mentioned here that observed spectra can be made much more 
complicated by multiple excitation processes, such as those in which several electrons 
are involved and change their states in the atom. This is especially likely in systems 
where there is strong mutual interaction between the electrons. In ionisation processes, 
for example, it is possible that a second electron is simultaneously excited. Excitation 
energies higher than the ionisation limit can be obtained if a light quantum simul- 
taneously removes one electron (ionisation) and raises a second one to a discrete excita- 
tion level. This makes the analysis of heavy-atom spectra much more difficult. 



Problems 

17.1 The energy levels of helium-like atoms with one electron in the ground state 
(« = 1) and the other in an excited state (n>\) can be expressed as 

n 

This expression is based on the assumption that the ground state electron completely 
shields one unit of nuclear charge. Discuss the plausibility of the expression. Calculate 
the energy levels for helium with n = 2, 3 and 4 and compare them with the experi- 
mental results. Why does the accuracy of the above expression for E increase as n 
increases? 

17.2 Show that the sum ^ (2/-h 1) over all possible values of J for a given pair of 
quantum numbers L and S is equal to the product (2L + 1)(25+ 1). What is the physi- 
cal meaning of this product? 

17.3 Discuss a two-electron system with a 2p and 3, 3d electron for the case of jj 
coupling and show that the number of possible states and their total angular momen- 
tum J are the same as in LS coupling. 

17.4 a) Ignoring spin-orbit coupling, determine the number of possible terms of an 
excited carbon atom with the electronic configuration \s^2s^2p3d, 

b) Calculate the effective magnetic moment of an atom in the ground state with the 
configuration \s^2s^2p^3s^As^3d^, assuming that L has the largest possible value 
consistent with Hund’s rule (Sect. 19.2) and the Pauli principle. 

c) Calculate the ground state of the atoms with the electronic configurations Ad 5 
(Y) or Ad^5s^ (Zr). [The closed shells are not given. L is determined as under b).] 

d) The manganese atom (Z = 25) has in its ground state a subshell which is exactly 
half-filled with 5 electrons. Give the electronic configuration and the ground state of 
the atom. 
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17.5 a) Calculate the maximum components of the magnetic moments in the direction 
of the magnetic field for vanadium (V), manganese (^S) and iron CD), if beams of 
these atoms are split into 4, 6 or 9 parts in a Stern-Gerlach experiment. 

b) What is the term symbol of the singlet state with a total splitting of v = 1 .4 cm “ ^ 
in a magnetic field = 0.5 tesla? 




18. X-Ray Spectra, Internal Shells 



18.1 Introductory Remarks 



Up to this point, the discussion has been concerned with the energy states and spectra 
of the most weakly bound electrons. In the lighter atoms, these are usually the outer- 
most or valence electrons. If we now turn to x-ray spectra, we shall be concerned with 
the energy states of electrons in inner shells. We shall see, however, that x-ray spectra 
can for the most part be treated as one-electron spectra, although they belong to many- 
electron atoms. 

Historically, it was the x-ray spectra which led to the theory of the shell structure of 
the atom (Kossel 1914). At present, however, we shall assume that the shell structure is 
familiar as we turn to the x-ray spectra. 




X-rays are usually generated by irradiating an anode, which is often called the anti- 
cathode, with fast electrons (Fig. 18.1). We point out here that x-rays may also be pro- 
duced by electron impact or collisional excitation of free atoms, and thus independent- 
ly of solid-state influences. The x-rays are detected by photographic plates, film, count- 
ing tubes, or more recently, by semiconductor detectors. The latter are made as silicon 
or germanium diodes. Absorption of x-rays in the “space-charge” zone releases charge 
carriers. These can be measured as in an ionisation chamber. Pulse-height analysis per- 
mits the apparatus to be used both to measure the energy of the x-ray quanta and as 
a simple spectrometer. For higher resolution spectroscopy and wavelength measure- 
ments, one still uses the crystal spectrometer described in Sect. 2.4.5. One can also ob- 
tain a rough estimate of the wavelength of x-rays by measuring their “hardness” - 
their ability to penetrate solids. 

18.2 X-Radiation from Outer Shells 

By “x-rays”, we usually mean electromagnetic radiation (light) which has a wavelength 
shorter than that of ultraviolet light - though there is no sharp boundary. The range is 
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usually considered to be 0.1 to 10 A , which corresponds to quantum energies of 1 - 100 
keV. The x-ray region is attained according to the series formula (Sect. 8.2) 

v = /?Z^(l/«^-l/n'h 

for hydrogen-like atoms, i.e. atoms with only one electron, if the nuclear charge is large 
enough. For Z = 20, the quantum energies are already 400 times as large as the energies 
of corresponding transitions in the hydrogen atom with Z = 1 . It is generally not 
possible to generate such “Balmer series” for highly ionised atoms in the laboratory, 
but these spectra can be observed in stellar atmospheres. Recently, such experiments 
have also become feasible through the use of particle accelerators; cf. Sect. 8.6. The 
Balmer series of was observed in the spectral range from 15 to 35 keV, and the 
Lyman series near 100 keV. 

18.3 X-Ray Bremsstrahlung Spectra 

If an anticathode is bombarded with electrons which have passed through an accelerat- 
ing voltage Vq, x-rays are generated. Spectral analysis of these reveals that 

- there is always a continuum, the x-ray bremsstrahlung (Fig. 18.2), 

- and under certain conditions, there is in addition a line spectrum, the characteristic 
spectrum (Fig. 18.3). 

If the intensity is plotted against the frequency, see Fig. 18.4, rather than the 
wavelength (as in Fig. 18.2), and if care is taken to ensure that the intensity distribution 
of the x-ray spectrum is not modified by reabsorption in the anticathode (cf. 
Sect. 18.6), the bremsstrahlung spectrum for an accelerating voltage Fq is described, 
to a good approximation (outside the low-energy range) by 

/(v) = const • Z(Vmax- v) , (18.1) 




Fig. 18.2. X-ray bremsstrahlung. Spectral energy distribution of the 
x-rays emitted from a massive tungsten anticathode at various 
accelerating voltages for the bombarding electrons. The intensity is 
given in arbitrary units 




Fig. 18.3. Line spectrum of a Rh anticathode doped with Ru 
impurity. The lines are superimposed on the bremsstrahlung 
spectrum. The intensity is plotted against the grazing angle of the 
crystal spectrometer instead of the wavelength 
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Fig. 18.4. The intensity / in the x-ray bremsstrahlung spec- 
trum as a function of the frequency for different primary 
energies of the electrons used for excitation. The measure- 
ment was carried out with a thick anticathode (after 
Kulenkampf) 




Fig. 18.5. Origin of x-ray bremsstrahlung. An electron with 
energy Eq is deflected in the field of a nucleus and slowed 
down. In the process a quantum of light is emitted 



where I is the intensity of the radiation (energy per time and frequency interval and 
solid angle) and Z is the atomic number of the anticathode material. The limiting fre- 
quency Vmax is given by 

This means that the high-energy or short-wavelength limit of the x-ray spectrum 
Vmax is given by the energy equivalent eVo- The bremsstrahlung spectrum is a result of 
the fact that when electrons pass close to the atomic nuclei, they are deflected and 
slowed down (Fig. 18.5). A positive or negative accelerated charge will, according to 
classical electrodynamics, emit electromagnetic radiation. This is “white” or con- 
tinuous x-ray bremsstrahlung. In terms of quantum theory, this can be understood as 
follows: for each braking incident, a quantum of light hv = Eq- E is emitted. How- 
ever, since the beginning and end states are not quantised - the electrons are free, not 
bound - a “white” spectrum arises when there are many individual events. 

The reaction equation is 

Atom -I- e“(fast) ^ Atom + e“(slow) + hv . 

In the limiting case, the entire energy of the electron is emitted in a single quantum in 
the course of a single braking event. This x-ray quantum then has the energy 
^Vmax = The measurement of this short-wave limit is one method of determining 
Planck’s constant h with great precision. However, one must be careful about such 
precision measurements, because the work of escaping the solid, and the band structure 
of the solid, lead to uncertainties or corrections of a few electron volts in the energies 
at the short-wavelength limit of the bremsstrahlung spectrum. The bremsstrahlung 
spectrum thus gives very little information about atomic structure. 

The spatial distribution of the radiation can also be explained in terms of the 
classical view of bremsstrahlung. With a thin anticathode, in which multiple events are 
less probable, and with energies cKq which are not too high, the distribution is the same 
as with the classical Hertzian dipole. The maximum is perpendicular to the direction 
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Fig. 18.6. Spatial distribution of the bremsstrahlung 



from which the electrons are coming, and thus to the direction in which braking is oc- 
curring. The minimum of radiation intensity is in the direction of electron travel 
(Fig. 18.6). At higher accelerating voltages Vq a relativistic calculation is required and 
the radiation pattern folds “ahead”, i.e. in the beam direction. 



18.4 Emission Line Spectra: Characteristic Radiation 

Characteristic radiation consists of a relatively small number of lines. Figure 18.3 gives 
an example of this. The lines are again grouped into series, which converge to a short- 
wavelength limit, which is called an “edge”. With a rhodium anticathode, for example, 
one can observe the following lines and series by increasing the accelerating voltage on 
the electrons in steps: 

For accelerating voltages Vq > 0.5 kV, the lines of the M series. 

For accelerating voltages fo > 3.0 kV, the L series also. 

For accelerating voltages fo > 23 kV, the K series as well. 

The lines of the K series are doublets. 

In general it holds for characteristic spectra that while optical spectra contain a 
large number of lines which depend on the nuclear charge Z in a rather complicated 
way, and which are strongly influenced by chemical bonding, x-ray spectra include a 
limited number of lines which can be grouped into a few series. There is also a clear 
relationship to the nuclear charge (Fig. 18.7). Corresponding lines and edges are found 
at increasing quantum energies as the nuclear charge increases. The series are 
designated by the letters K, L, M, N, ... and the lines within the series by Greek lower 
case letters beginning with a. The fine structure splitting of the lines is indicated by 
numbers written as subscripts. 

To a good approximation, the first line of the K series, the line can be described 
for atoms with different nuclear charge numbers Z by the expression 

Vk„ = \f = R{Z- 1)^(1/!^- 1/2^) . (18.3) 

The first lines of the L series (L J are described by 

^i?(Z-7.4)^ = i?(Z-7.4)^(l/2^- 1/3^) . (18.4) 

A linear relationship between ]/v and the nuclear charge number Z for analogous 
X-ray lines or edges (see Sect. 18.6) in the spectra of different elements was discovered 
in 1913 by Moseley (Moseley lines; see Fig. 18.8). Comparison with the Balmer for- 
mula for hydrogen suggests that for the K line the nuclear charge is screened by one 
unit of charge, while for the L line, it is screened by almost eight units. 
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Fig. 18. 8. Mo seley diagram of the absorption edges. For the edge frequen- 
cies V, ^|V^R - {Z-s)/n, where 5 is the screening number (cf. (18.3), n and 
5 are different for different shells, and this is why the observed Moseley 
lines are not parallel. The lines for the spin doublets, e.g. Ljj diverge at 
the top because spin-orbit coupling increases with Z; cf. Sect. 18.5. 
[From K.H. Hellwege: EinfUhrung in die Physik der Atome, Heidelberger 
Taschenbiicher, Vol. 2, 4th ed. (Springer, Berlin, Heidelberg, New York 
1974) Fig. 72] 



i Fig. 18.7. Spectral positions of the characteristic x-ray 
emission lines and the absorption edges of the elements. 
The quantum energies increase with increasing nuclear 
charge number 




Chemical bonding of an atom has only a slight influence on its x-ray spectrum. 
However, exact measurement of this effect does provide important information about 
the behaviour of electrons in chemical bonds. This is of importance in molecular and 
solid-state physics. 

The emission of x-rays can be elicited not only by bombarding an anticathode with 
electrons, but also by irradiation of atoms, molecules or solids with x-rays. This is 
called x-ray fluorescence. 

The wavelength of the x-radiation is greater than, or at least equal to, that of the 
exciting light, but other than that, it is independent of the wavelength of the exciting 
radiation within certain limits. The lines of a series appear in a fluorescence spectrum, 
and then all of them at once, only when the quantum energy of the exciting radiation is 
at least as great as the quantum energy of the highest-energy, or shortest-wavelength 
line in the characteristic spectrum. It is the same with excitation of x-radiation by 
electron bombardment: the kinetic energy of the elctrons eVo must be at least as great 
as the quantum energy of the shortest-wavelength line of the series before this series 
appears in the emission spectrum. Thus emission of the line cannot be excited by the 
quantum energy of instead it is necessary to supply the energy of the K edge. This is 
the energy to which the lines of the K series converge, the series limit. From this and 
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Other observations, it was concluded that x-ray lines correspond to states of “inner” 
electrons which are bound in filled shells, in contrast to the more loosely bound outer 
electrons, which give rise to the optical spectra. 

In 1916, Kossel interpreted the generation of the x-ray line spectra as follows: first 
the exciting electron must remove an atomic electron from an inner shell. The resulting 
hole is filled by outer electrons, and their binding energy is released in the form of char- 
acteristic light quanta. All transitions which end on the same inner shell occur together, 
and form a series (Fig. 18.9). 




Fig. 18.9. Schematic explanation of the K, L and M 
series in x-ray spectra. Left: An electron hole is formed 
by ionisation of an inner shell {outward-pointing 
arrow). This is filled by an electron from a shell which 
is farther out. The binding energy is emitted as an x-ray 
quantum {inward-pointing arrow). Right: The same in 
the form of a term scheme. The ionisation limit is 
shaded in at the top. Unfortunately, in practice and by 
historical reasons, the use of the greek letters in the 
L, M, N . . . series is less systematic than indicated here, K 
cf. Fig. 18.9 
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The quantitative observations described by (18.3) and (18.4) thus become 
understandable: the atomic number Z is screened by one elementary charge in the K 
shell and by 7.4^0 in the L shell for the electron making the transition. 

The transitions involving inner shells are much more energetic than those in the 
outermost shell, because the nuclear charge is shielded only by those electrons in still 
lower shells. This results in screening to a charge (Z-1) for the lines, and to 
{Z-1 .4) for the lines. The field strength in the interior of a sphere with a uniformly 
charged surface is zero, so the external electrons make no contribution to the field ex- 
perienced by the inner ones. 



18.5 Fine Structure of the X-Ray Spectra 

The x-ray transitions indicated by Greek letters, K„, Kp, Lp, etc. thus start from 
terms with different principal quantum numbers n. To understand the “fine structure” 
of x-ray spectra, that is the occurrence of several components in a given transition, one 
must also take into account the orbital angular momentum and spin of the electrons. 
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For electrons in inner shells, orbital degeneracy (/ degeneracy) is naturally lifted. 
The reason for this, the different degrees of screening for electrons with different 
orbital angular momenta and the associated differences in the Coulomb potential, has 
already been discussed in the case of the spectra of alkali atoms (Sect. 11.2). Further- 
more, we must also take into account the actual fine structure due to spin-orbit 
coupling. The energy of this coupling increases rapidly with nuclear charge, as 
(Sect. 12.8). In heavy atoms such as uranium the spin-orbit splitting amounts to as 
much as 2 keV ! One can understand the structure of x-ray spectra if one realises that a 
missing electron, or a hole, in an otherwise full shell is equivalent to a single electron in 
an otherwise empty shell or also to an added positron. The angular momentum of the 
remainder of the atom clearly has the same magnitude, but the opposite sign, compared 
to the angular momentum of the electron which was removed. Naturally this 
equivalence goes only as far as the sign: to remove an electron from the atom we must 
apply energy. If we consider the binding energy of the electron to be negative, then we 
must consider the energy required to generate a hole to be positive. 

X-ray spectra can thus be understood, similarly to the spectra of alkali atoms, as 
one-electron (or one-hole) spectra. The terms may be characterised, as in the alkalis, 
by the quantum numbers of one electron; we thus arrive at a term diagram of the type 
shown in Fig. 18.10. 

In the K shell, n = 1, I can have only the value 0, j is equal to 1/2, and the state 
is denoted by the symbol ^Si/ 2 . 

For optical transition, the selection rules are Zl/ = ±1 and Ziy = 0, ± 1. The longest- 
wavelength lines of the K series, and Kf^ 2 ^ are thus produced in a manner analo- 
gous to the two sodium D lines; see Sect. 12.9. They connect the state n = \, with 
the states n = 2 ^Py2 and ^^ 3 / 2 , which are split by the spin-orbit interaction. 
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Fig. 18.10. Fine structure diagram for the x-ray spectra of a 
platinum anode with Z = 78. The notations for the series, 
the lines, and the quantum numbers are shown. Arrows 
pointing upwards mean absorption, those pointing down- 
wards mean emission. The use of Greek letters to denote the 
individual lines is not systematic and is not uniform in the 
literature. - The spacings between the L subshells Lj, 

Lju and the M subshells M^-M^ are not shown to scale. 
For a given /-value, they result from the normal doublet 
splitting; otherwise, they are produced by differing screen- 
ing of the nuclear charge and are therefore not all equal. 
The Greek-letter symbols for x-ray lines are of historical 
origin and are not identical to the systematic notation used 
in Fig. 18.9 
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Correspondingly, we can understand all of the fine structure of x-ray spectra. The 
shells which are characterised by the quantum numbers n are also split up into sub- 
shells. The latter are numbered using Roman numerals (e.g. Tj, Tn, Tni in Fig. 18.10). 
A subshell is characterised by a triplet of quantum numbers n, I, and j. The energy 
splitting between the edges Tj, Lj,, and Z.,n has, as indicated in Fig. 18.10, a variety 
of origins. The spacing between L,, and i.e. between and ^^* 3 / 2 , is the well- 
known doublet splitting, which increases with the fourth power of the screened nuclear 
charge Z (cf. Sects. 12.8 and 18.4). The spacing between £j and Ln, on the other 
hand, results from variations in screening. The latter is produced only by the inner elec- 
trons and is less dependent on Z. This is clarified in the Mosely diagram (Figs. 18.7 
and 8 ). 

If the atoms are not free, but instead are bound in a solid, then the upper occupied 
electronic energy levels are broadened into energy bands. X-ray lines resulting from 
transitions between the broad uppermost occupied energy band and an inner electronic 
level, which is not broadened by solid state effects, have a characteristic structure. 
Their width and intensity distribution can give a direct picture of the width of the up- 
permost energy band and its occupation by the electrons. 



18.6 Absorption Spectra 

X-rays, like any other electromagnetic radiation, are absorbed and scattered on passing 
through matter. The primary experimentally determined quantity is the extinction coef- 
ficient yu, defined by the equation 7 = 7oexp(-//x), where x is the thickness of the 
material irradiated, 7 q is the incident intensity, and 7 is the transmitted intensity. The 
result of a measurement is often given as the half-absorption thickness, d = \r\2. 

The half-absorption thickness depends, in general, on the material irradiated and on 
the energy of the x-ray quanta. Table 18.1 gives some numerical data. 



Table 18.1. Half-absorption thickness [cm] for x-rays in aluminium and lead 
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The extinction is the sum of scattering - which does not interest us here - and 
absorption. The dependence of the absorption coefficient on quantum energy, i.e. the 
spectral distribution in absorption spectra, is represented schematically in Fig. 18.11, 
where the emission spectra at various excitation energies are also shown for com- 
parison. 

X-ray absorption spectra typically display a large decrease in the absorption coef- 
ficient with increasing quantum energy, and absorption “edges”, which are quantum 
energies at which the absorption coefficient jumps to a higher value. These “edges” 
correspond to the series limits for the K,L,M,... series, and they are correspondingly 
labelled. The subshells also appear as edges, for example Lj, L^, andTjn in Fig. 18.11. 

The position of the K edge for lead at 88 keV (Fig. 18.12) means that the work of 
removing an electron from the K shell, where it experiences the field of the nearly 
unshielded nuclear charge of the lead nucleus, is 88 keV. The screening for lead 
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Fig. 18.11 a- e. Comparison of x-ray absorption 
and emission spectra, a) Absorption coefficient, e.g. 
of platinum for x-rays, as a function of frequency, 
schematically. The spectrum consists primarily of 
superimposed absorption edges and series-limit con- 
tinua. b-e) Emission spectrum of platinum at 
various excitation energies. All the line series are ex- 
cited in b, in c the K series is lacking, in d the K and 
L| series are absent, and in e the K, and Ljj series 
are missing. The spectral intensity distribution of the 
continuous (bremsstrahlung) spectrum is modified 
here, relative to the shape given by (18.1), as a result 
of frequency-dependent absorption. For platinum, 
the frequency value of the K edge is ca. 19 - 10^^ s~ ’ 
and that of the Lj edge is ca. 3- 10~ s“ ^ 



(Z = 82) can be calculated. For the innermost electron, the work of separation is 
Zeff 13.6, where Zgff = Z-s is the effective nuclear charge and 13.6 eV is the ionisa- 
tion energy for the hydrogen atom (Sect. 8.4). From (82 - 5 )^ -13.6 eV = 88 keV, 
5 = 1.61. 

In order for an atom to absorb x-radiation, an electron must be excited from an 
inner shell into a less strongly bound state. Since the neighbouring shells are already 
occupied, discrete absorption lines due to transitions from one shell into another are 
scarcely observable. There is, however, a continuum of free states on the other side of 
the series limit into which the absorbing electron can be lifted. Therefore, absorption is 
usually associated with ionisation, and the absorption spectra are the superimposed 
series-limit continua of the various shells and subshells. This is shown in Fig. 18.11. 

Going from lower to higher frequencies, absorption edges, or jumps in the absorption 
coefficient, are located at those points where the energy of the x-ray quantum is just suf- 
ficient to allow an absorptive transition from a new (lower) shell into the limiting con- 
tinuum. 

At lower frequencies, the quantum energy hv is only sufficient to release electrons 
from outer shells. Ashv increases, an energy is reached which is sufficient to release even 
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Fig. 18.12. Absorption cross section of lead for x-rays in the region 
of the L and K edges. The absorption coefficient is expressed in 
terms of the absorption cross section of an atom 



K electrons, and at this point the absorption coefficient increases abruptly. The fine 
structure of the absorption edges is further evidence for the existence of shells and 
subshells: there is one K edge, but 3L edges, 5 M edges, and so on. 

In addition, if the spectral resolution is good enough, it is possible to detect effects of 
chemical bonding on the energies and fine structures of the absorption edges. 

Aside from the edges, the frequency dependence of the absorption coefficient is 
essentially expressed by 

/iabs = ZV(A v)^ with 3 < X < 4 (18.5) 

/^abs = A Z . 

The v" ^-dependence of the absorption coefficient can be derived using the 
Hertzian model of a free electron as a forced oscillator. The hardness or penetrating 
ability of the x-rays thus increases as the wavelength decreases, or as the accelerating 
voltage increases. Figure 18.12 shows the frequency dependence of the atomic absorp- 
tion cross section (see Sect. 2.4.2) of lead in the region of the K and L edges. 



18.7 The Auger Effect 

Not all atoms from which electrons have been removed from inner shells by electron 
bombardment or other forms of energy transfer return to the ground state by emitting 
x-rays. Instead, the observed quantum yield for x-ray emission is frequently less than 1 . 
It is defined here as 

^ _ Number of x-ray emitting atoms 
Number of K, L.,. ionised atoms 

Thus it must be possible for the atoms to return to the ground state without emitting 
radiation. The probability of such non-radiative processes, which compete with x-ray 
emission, has been found to decrease with increasing nuclear charge. In light atoms, the 
non-radiative processes far outweigh emittive processes (Fig. 18.13). 

The non-radiative return to ground state is accomplished by the Auger effect. After 
an electron has been removed from an inner shell, the excess energy can be released 
either in the form of an x-ray quantum or through the emission of an electron from a 
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Fig. 18.13. The quantum yield for the emission of x-rays as a 
function of the Z number (schematically) 



X-ray emission 




Fig. 18.14. Auger electron emission 
{right) competes with x-ray emission 
{left) 



shell farther out. The Auger effect is thus similar to an “inner photoeffect”. If one elec- 
tron falls into a lower shell and another is simultaneously emitted, the Coulomb inter- 
action between the two must be responsible for the process. 

The Auger effect is represented schematically in Fig. 18.14. First the K shell is 
ionised. An L electron falls from the L to the K shell, and fills the hole there. The 
energy released by this is used to remove a second L electron from the L shell, and this 
one escapes from the atom. The result is that the L shell loses two electrons. These are 
then replaced by electrons from shells farther out, namely the M shell and so on. 

The kinetic energy of the Auger electron is given by 

£'kin= hvK^-Ei = Ek-El-El = Ex~2El , (18.6) 

where E^ and Ek are the binding energy of the electrons in the L and K shells, 
respectively. 

Auger electrons can also be observed directly in a cloud chamber. Their energy can 
be determined from the length of their tracks in the cloud chamber, or by means of a 
braking field method, and thus can be used to verify the explanation of the Auger 
effect. 

Finally, a numerical example: Silver is bombarded with Ka radiation from a 
tungsten anticathode (energy = 59.1 keV). Electrons with the following energies are 
observed: 

1) 55.8 keV Interpretation: Photoelectrons from the Ag L shell 

Because: the ionisation energy of the Ag L shell is 
EionL = 3.34 keV 

Therefore: 59.1 - 3.34 = 55.76 keV. 

2) 33.8 keV Interpretation: Photoelectrons from the Ag K shell 

Because: EioaK= 25.4 keV 
Therefore: 59.1-25.4 = 33.7 keV. 

3) 21.3 keV Interpretation: Auger electrons 

Because: Ex iAg) - Ei^nt = 24.9 - 3.34 

= 21.56 keV. 
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4) 18.6 keV Interpretation: Auger electrons 

Because: (Ag) - = 22.1 - 3.34 

= 18.76 keV. 



18.8 Photoelectron Spectroscopy (XPS), ESC A 

A relatively new method for investigating the energy states of the inner electrons of an 
atom is photoelectron spectroscopy. This technique is a modern application of the 
photoelectric effect (Sect. 5.3). 

Electrons are ejected from the shells of an atom by exciting them with light of a 
known quantum energy. The kinetic energies of the photoelectrons correspond to the 
difference between the quantum energy of the photons and the binding energies of the 
electrons in the atom, following the energy-balance equation of the photoelectric ef- 
fect, E'kin = ^^"^bind* The principle is illustrated in Fig. 18.15. Figure 18.16 shows as 
an example the binding energies in some light atoms. An example of a measurement 
for the Hg atom is given in Fig. 18.17. With this method, the binding energies of inner- 
shell electrons can be determined directly, in contrast to x-ray absorption spectroscopy, 
which gives only the energy of the absorption edge. 

The measurement of the kinetic energy of the photoelectrons is performed with 
high-resolution analysers, which allow a precise determination of the velocities of the 
electrons using the principle of the determination of e/m (deflection in electric and 
magnetic fields) described in Sect. 6.4. (Simultaneous measurement over a wide range 
of electron kinetic energies is also possible using time-of-flight analysers, with some- 
what poorer energy resolution.) The best energy resolutions currently available allow 
determination of the electron energies to about 20 meV. 

The light source provides either UV light of short wavelengths, e.g. the resonance 
lines from the Ne and the He spectra in the region between 15 and 50 eV, or else, for the 
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Fig. 18.15. Illustration of photoelectron spectroscopy. The kinetic energy 
of the photoelectrons is the difference between the quantum energy hv of 
the excitation photons and the binding energy of the electrons in an atom 
or a solid. The dashed lines represent the orbitals of the electrons. One 
should note, however, that here the binding energies, and not the dis- 
tance from the nucleus, are indicated 
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Fig. 18.16. Binding energy of the electrons in the H, Li, and F 
atoms; the atomic K, and shells are indicated. The 

zero point for the binding energy is the ionisation limit 
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Binding Energy 



Fig. 18.17. The photoelectron spectrum 
of Hg vapour, after K. Siegbahn (over- 
view of the spectrum). All the levels 
which can be excited with the radia- 
tion from A1 are to be seen. These are the 
N and O shells, as well as P\. Compare 
also the electron configuration of the 
Hg atom from Table 19.3b 




Fig. 18.18. The spectral distributions of different excitation light sources for photoelectron spectroscopy and 
of synchrotron radiation from different primary electron energies. The ordinate scales are in arbitrary units 
and are each different. After Bethge and Gruber 



investigation of states with higher binding energies, the characteristic x-ray lines, e.g. 
the lines from Cu (8048 eV), from A1 (1487 eV), or from Mg (1254 eV). Synchrotron 
radiation, which has a continuously variable photon energy in the whole UV and x-ray 
region (cf. Sect. 5.1), is a particularly suitable light source. An overview is given in 
Fig. 18.18. 

The binding energies of the electrons are characteristic of the particular atoms being 
investigated, so that the method can also be used for chemical analysis of a sample. 
Furthermore, the chemical bonding between atoms in molecules or in solids leads to a 
redistribution of the valence electrons. This, in turn, changes the entire bonding poten- 
tial in the atoms, whereby even the inner electrons are affected. The resulting small 
shifts in the inner-shell binding energies due to changes in the state of the outer (val- 
ence) electrons are termed chemical shifts and can be measured, e.g. between atoms in 
different ionisation states. The corresponding analytical method, called ESCA 
(Electron Spectroscopy for ChemicaMnalysis), was developed in particular hy K. Sieg- 
bahn and coworkers. It has become an important experimental technique in chemistry 
and in molecular and solid state physics. 
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Problems 

18.1 What is the shortest possible wavelength for bremsstrahlung observed when an 
electron which has been accelerated through a potential difference of 40 kV is stopped 
by the anticathode of an x-ray tube? In what region of the electromagnetic spectrum 
does this wavelength lie? 

18.2 The Kd line of cobalt is at 1 .785 A. What is the energy difference between the 
and 2p orbitals in cobalt? Compare this result with the energy difference between the 
is and 2p orbitals in hydrogen (that is, the first Lyman line). Why is the difference 
much greater for cobalt than for hydrogen? 

18.3 The most intense line in the x-ray spectrum arises from the transition in which an 
electron goes from the shell with n = 2 to the shell with « = 1 . The transition is de- 
scribed by Moseleys rule (cf. Sect. 18.4). What is the wavelength of this line for copper? 

18.4 The maximum energy of the characteristic x-rays emitted by a sample of un- 
known composition corresponds to the wavelength 2.16 A. Of what element does the 
sample consist? 

18.5 X-rays are allowed to pass through aluminium foils, each 4 x 10~^ m thick. 
A Geiger counter registers 8 x 10^ 4.7 x 10^ 2.8 x 10^ 1.65 x 10^ and 9.7 x 10^ 
events/min when the rays have passed through 0, 1, 2, 3 and 4 foils, respectively. 
Calculate the linear absorption coefficient of aluminium. 

18.6 Gamma rays with energies of 0.05, 0.3 and 1 MeV, but the same intensities, fall 
onto a lead absorber. The linear absorption coefficients for these energies are 8 x 10^, 
5x10^ and 78 m“^ 

a) Calculate the thickness of lead required to reduce the intensity of each beam of 
gamma rays to one tenth of its original value. 

b) What is the relation of the total intensity (at each photon energy) at a depth of 5 mm 
to the total incident intensity? 

18.7 How many times the half-absorption thickness of a material is required to 
reduce the intensity of an x-ray beam to (a) 1/16, (b) 1/20, or (c) 1/200 of the incident 
intensity? 

18.8 a) The K absorption edge of tungsten is at 0.178 A, and the wavelengths of the 
lines of the K series are (ignoring the fine structure) = 0.210 A, Kp =0.184 A, and 
Ky = 0.179 A. Sketch the energy levels of W and give the energies of the K, L, Mand N 
shells. 

b) What is the minimum energy required to excite the L series in tungsten? What is 
the wavelength of the L line? 

18.9 The Li absorption edge in tungsten is at 1.02 A. Assume that a K„ photon is 
“absorbed” by one of the 2s electrons by an Auger process. Determine the velocity of 
the photoelectron released. 
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Ground States of the Elements 



19.1 Periodic System and Shell Structure 

It is one of the goals of atomic physics to understand the ordering and the properties of 
the chemical elements in the periodic system. The empirically determined physical and 
chemical properties of the atoms and their dependence on the atomic number Z ought 
to be explainable starting from the electronic structures of the atoms. After having, in 
the previous chapters, discussed one-electron spectra, many-electron spectra, and the 
spectra of the inner shells in detail, we are now in a position to understand, at least in 
principle, the spectra of any atom in any state of excitation. We will outline and extend 
this understanding in the present chapter. 

In particular, in the discussions of alkali atoms and x-ray spectra, we have met with 
some important experimental facts which led to the concept of the shell structure of 
atoms. It is known that the noble gases are chemically particularly stable. Furthermore, 




Fig. 19.1. Electronic shells of the ions of 
atoms, following Kossel, 1916. The number 
of electrons is plotted against the nuclear 
charge number for atoms and for the ions 
which are preferentially formed from those 
atoms in chemical bonding. The electronic 
shells of the atoms are depleted or filled up 
to the electron numbers indicated by arrows 
in the diagram when ions are formed. With 
the exceptions of nickel and palladium, the 
preferred electron numbers are those of the 
noble gases. [From K. H. Hellwege: Einfiih- 
rung in die Physik der A tome, Heidelberger 
Taschenbucher, Vol. 2, 4th ed. (Springer, 
Berlin, Heidelberg, New York 1974) Fig. 61] 
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if the electrochemical valency of an ion is compared to the distance of the correspond- 
ing neutral element from the nearest noble gas in the periodic table, it is found that the 
atoms have been ionised to such an extent that the ions have assumed the electron 
number of a neutral noble gas. The observed ionisation levels of the atoms are shown in 
a way that makes this clear in Fig. 19.1. Thus when an ion is formed, those electrons 
which are in excess of the electron number of a noble gas are most easily lost, or the 
electrons which are lacking from the electron number of the next heavier noble gas are 
most easily acquired. Consider, for example, the elements around the noble gas neon 
(Z = 10). The next element in the periodic table is sodium (Z = 11), which preferentially 
forms singly charged positive ions. The next element, magnesium (Z = 12), forms 
doubly charged positive ions. The next lighter element than neon, fluorine (Z = 9), 
forms singly charged negative ions (by acquisition of an electron) and so on. 

It can be seen from Fig. 19.1 that nickel and palladium also have preferred electron 
numbers, because the atoms of neighbouring elements tend to assume these numbers 
when they become ionised. The stable electron configurations are thus not limited to 
the noble gases. This is made clear in the discussion of Table 19.3. 

We know from the spectroscopic studies of atoms discussed in earlier chapters that 
the unusually stable electron configurations are characterised by complete mutual com- 
pensation of all angular momenta and magnetic moments. Furthermore, the shell and 
subshell structure of the electron distribution of atoms appears especially clearly in x- 
ray spectra as a system of absorption edges. 

We now turn to the question of which electronic configurations are possible in 
atoms, which are particularly stable, and how the electrons of an atom are distributed 
among the possible quantum states. 

To understand this, one first needs the quantum numbers with which each atomic 
electron can be characterised: 

the principal quantum number n, 

the orbital angular momentum quantum number / = 0, 1 . . . « - 1 , 

the magnetic quantum number m/ = 0, ± 1 . . . ± /, 

the magnetic spin quantum number 171^= ± 1/2. 

One also needs the Pauli principle. In non-mathematical terms, this says that only 
those atomic states can exist in nature in which any two electrons differ, in at least one 
of their quantum numbers. This results in limitations in the possible combinations of 
quantum numbers in an atomic state. 

Strictly speaking, in order to define the quantum numbers of an electron in the 
atom, one would have to solve the Schrodinger equation for a many-particle problem, 
namely for all the electrons in the atom. The problem is solved approximately using the 
Hartree-Fock technique, which is based on the model of independent particles (Sect. 
19.4). The basic idea is that instead of trying to calculate the interactions of N - 1 elec- 
trons with the Mh electron, one replaces the Coulomb attraction of the nucleus for the 
Mh electron by an effective potential. One then calculates the eigenstates and eigen- 
values of the A^th electron in this potential field. These one-particle functions are often 
referred to in the literature as orbitals. 

Figure 19.2 shows, as the result of such calculations, the shell structure of the 
hydrogen atom and the positive ions of lithium, sodium and potassium. As the figure 
shows, the shells overlap, and thus have no unique geometric significance. As the 
nuclear charge increases, they are drawn closer to the nucleus. Another example of a 
calculated electron density distribution was discussed in Fig. 11.8. 
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Fig. 19.2. Radial density distribution of electrons in the 
hydrogen atom, and the singly charged positive ions of 
lithium, sodium and potassium. It is especially noticeable 
how the K shell electrons move closer to the nucleus as the 
nuclear charge increases 



In the absence of an external magnetic field, the magnetic quantum numbers would 
seem to lose their function, because of degeneracy. In order to apply the Pauli principle 
to atomic terms in such cases, we make use of of Ehrenfesfs adiabatic invariance 
principle, which states that if a parameter changes continuously, the states of an atom 
also change in a continuous and uniquely determined way. In the present case, this 
means that the states of the atom in the absence of a magnetic field must derive from 
those in the presence of a strong field - where the L*S-coupling and the m-degeneracy 
are lifted - in a continuous and uniquely determined way as the field is slowly turned 
off. 

With the Pauli principle it is easy to count the maximum number of electrons with 
a given principal quantum number n which can be bound to an atom: 

- For a given value of the principal quantum number n there are n different values for 
the orbital angular momentum quantum number /. 

- For every value of / there are 2/+1 different values of the magnetic quantum 
number m/. 

- For each pair of quantum numbers / and m/ there are two different values of the 
spin quantum number m^. 

- Thus for each pair of numbers n, I there are at most 2(2/+ 1) electrons. 

The maximum number of electrons in a shell with a given value of n is then 

n-\ 

I2(2/+l) = 2n^ (19.1) 

1=0 

Table 19.1 gives the possible combinations of quantum numbers for « = 1, 2 and 3. 

If a shell is defined as the group of all electrons with the same principal quantum 
number n, then the atoms with closed shells are helium (« = 1), neon (« = 2), nickel 
(n = 3, Z = 28) and neodymium (n = 4, Z = 60). Nickel and neodymium are neither 




326 



19. Structure of the Periodic System. Ground States of the Elements 



Table 19.1. Possible quantum numbers and numbers of electrons in the shells with « = 1, 2, 3 
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noble gases nor chemically very inactive. The simple equation of a closed shell with a 
noble gas configuration breaks down for higher electron numbers. In the He atom, the 
K shell is fully occupied; in Ne, the K and L shells; and in Ar, also the 5-- and /7-subor- 
bitals of the M shell. 

Instead, it is observed that the closure of a partial shell, that is occupation of all 
states with the same value of / for a given value of n, leads to especially stable electron 
configurations. This is understandable in light of the fact that even for fully occupied 
partial shells, the angular momentum and magnetic moments add up to zero, so that 
the atom is outwardly spherically symmetric. In fact, this is the case with the third 
noble gas, argon, which has the electronic configuration Xs^ls^lp^'is^'ip^, In other 
words, all of the s and p electrons are present in the third shell, but there are no d 
electrons. The cases of the noble gases krypton and xenon are similar, as is shown in 
Table 19.2. 



Table 19.2. Electronic configuration for the highest occupied shells or partial shells of the noble gases. Only 
helium and neon have completely filled highest shells. The particular stability of the other noble gases is due 
to completion of partial shells 



Highest 
occupied state 


z 


Element 


1st ionisation 
potential [eV] 


(15)^ 


2 


He 


24.58 


(Isfdpf 


10 


Ne 


21.56 


OsfOpf 


18 


Ar 


15.76 


(Asf(^df^(Apf 


36 


Kr 


14.00 


(Ssf(^d)^^(5pf 


54 


Xe 


12.13 


(6s)\Af)^\5d)'\6pf 


86 


Rn 


10.75 



Having seen that each partial or complete shell can only accommodate a certain 
maximum number of electrons, we can turn to the configurations of all the atoms of 
the periodic system. Table 19.3a and b shows all the elements with their electronic con- 
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figurations. Closed complete and partial shells are shaded in. The table also shows the 
first ionisation energy for each element. 

With increasing atomic number Z, the added electrons successively occupy the un- 
filled shells and subshells. The order of this filling depends on the binding energies; 
in particular, for heavy atoms it does not correspond simply to the numerical order. 
The first deviation is found in the K atom: there, it is energetically more favorable for 
the last electron to occupy the 45 subshell rather than the 'id. Compare also Sect. 11.3. 

The interaction energies of the electrons with the nucleus and with each other, 
which play a key role in determining the energetic positions of the subshells and thus 
the structure of the Periodic Table, are dominated for lighter elements by purely elec- 
trostatic interactions, particularly screening of the nuclear charge by the other elec- 
trons. With increasing atomic number Z, the magnetic interactions, specifically the 
spin-orbit interaction, become comparably large and, for the heaviest elements, finally 
determine the ordering of the electronic states. 

Figure 19.3 again summarises the energetic ordering of the subshells for the suc- 
cessive addition of electrons, corresponding to the shell structure. It is characterised 
by the fact that at certain energies, large gaps between the energy levels are found. 
These mark the especially stable configurations, for example those of the noble gases. 

The shell structure that we have described here, which is important for understan- 
ding the Periodic System, is - as we again emphasize - that of the last electron added 
to the atom. When the initially outermost electrons move inwards, as the atomic num- 
ber increases and further electrons are added to the outer shells, they become more and 
more subject to the effective potential of the other electrons and the ordering of their 
energy levels changes. This is illustrated in Fig. 19.3. The energetic order of the sub- 
shells then becomes a function of the principal quantum number n only. This shell 
structure, determined only by «, is the one we have already met with in treating the 
x-ray spectra of atoms. It can also be investigated using photoelectron spectroscopy. 




added 



Fig. 19.3. The shell structure of the atomic energy levels and their 
ordering, for the last added electron and for the inner electrons. 
Especially stable configurations are those in which a large energy 
gap to the next level occurs 




Table 19.3a. Periodic Table with electron configurations, ground state terms, and ionisation energies. The 
filled shells and subshells are shaded 



Atomic Element 


Shells 






LS 

configuration 


First 

ionisation 


number 

Z 


K L M 

n = 2 n = 3 

s s p s p d 


N 

n = 4 
s p d 


0 

n = 5 
s p 


of the ground 
state 


potential 

[eV] 



1 


Hydrogen 


H 


1 






















13.60 


2 


Helium 


He 


2 




















'5. 


24.58 


3 


Lithium 


Li 


2 


1 




















5.39 


4 


Beryllium 


Be 


2 


2 




















9.32 


5 


Boron 


B 


2 


2 


1 


















8.30 


6 


Carbon. 


C 


2 


2 


2 


















11.26 


7 


Nitrogen 


N 


2 


2 


3 


















14.54 


« 


Oxygen 


O 


2 


2 


4 


















13.61 


9 


Fluorine 


F 


2 


2 


5 


















17.42 


10 


Neon 


Nc 


2 


2 


6 


















21.56 


] 1 


Sodium 


Na 


2 


2 


6 


1 
















5.14 


12 


Magnesium 


Mg 


2 


2 


6 


2 
















7.64 


13 


Aluminium 


A1 


2 


2 


6 


2 


1 














5.98 


14 


Silicon 


Si 


2 


2 


6 


2 


2 














8,15 


15 


Phosphorous 


P 


2 


2 


6 


2 


3 














10.55 


16 


Sulphur 


S 


2 


2 


6 


2 


4 














10.36 


17 


Chlorine 


Cl 


2 


2 


6 


2 


5 














13.01 


IH 


Argon 


Ar 


2 


2 


6 


2 


6 












'So 


15,76 


\9 


Potassium 


K 


2 


2 


6 


2 


6 


1 












4.34 


20 


Calcium 


Ca 


2 


2 


6 


2 


6 


2 












6.11 


21 


Scandium 


Sc 


2 


2 


6 


2 


6 1 


2 












6.56 


22 


Titanium 


Ti 


2 


2 


6 


2 


6 2 


2 












6.83 


23 


Vanadium 


V 


2 


2 


6 


2 


6 3 


2 












6.74 


24 


Chromium 


Cr 


2 


2 


6 


2 


6 5 


i 










’S, 


6.76 


25 


Manganese 


Mn 


2 


2 


6 


2 


6 5 


2 










‘S,., 


7.43 


26 


iron 


Fc 


2 


2 


6 


2 


6 6 


2 












7.90 


27 


Cobalt 


Co 


2 


2 


6 


2 


6 7 


2 












7.86 


2S 


Nickd 


Ni 


2 


2 


6 


2 


6 8 


2 












7.63 


29 


Copper 


Cu 


2 


2 


6 


2 


6 10 


1 












7.72 


30 


Zinc 


Zn 


2 


2 


6 


2 


6 10 


2 












9.39 


31 


Gallium 


Ga 


2 


2 


6 


2 


6 10 


2 


1 










6.00 


32 


Germanium 


Ge 


2 


2 


6 


2 


6 10 


2 


2 










7.88 


33 


Arsenic 


As 


2 


2 


6 


2 


6 10 


2 


3 










9.81 


34 


Selenium 


Se 


2 


2 


6 


2 


6 ID 


2 


4 










9.75 


35 


Bromine 


Br 


2 


2 


6 


2 


6 10 


2 


5 










n.84 


36 


Krypton 


Kr 


2 


2 


6 


2 


6 10 


2 


6 










14,00 


37 


Rubidium 


Rh 


2 


2 


6 


2 


6 10 


2 


6 




1 




'Si, 2 


4,18 


38 


Strontium 


Sr 


2 


2 


6 


2 


6 10 


2 


6 




2 




’So 


5.69 


39 


Yttrium 


Y 


2 


2 


6 


2 


6 10 


2 


6 


1 


2 






6.38 


40 


Zirconium 


Zr 


2 


2 


6 


2 


6 10 


2 


6 


2 


2 






6.84 


41 


Niobium 


Nb 


2 


2 


6 


2 


6 10 


2 


6 


4 


1 






6.88 


42 


Molybdenum 


Mo 


2 


2 


6 ' 


2 


6 10 


2 


6 


5 


1 






7.13 


43 


Technetium 


Tc 


2 


2 


6 


2 


6 10 


2 


6 


6 


1 






7.23 


44 


Ruthenium 


Ru 


2 


2 


6 


2 


6 10 


2 


6 


7 


1 






7J7 


45 


Rhodium 


Rh 


2 


2 


6 


2 


6 10 


2 


6 


8 


1 






7.46 


46 


Palladium 


Pd 


2 


2 


6 


2 


6 10 


2 


6 


10 






’So 


8.33 


47 


Silver 


Ag 


2 


2 


6 


2 


6 10 


2 


6 


10 


1 






7.57 


48 


Cadmium 


Cd 


2 


2 


6 


2 


6 10 


2 


6 


10 


2 




■So 


8.99 


49 


Indium 


In 


2 


2 


6 


2 


6 10 


2 


6 


10 


2 


1 




5.79 


50 


Tin 


Sn 


2 


2 


6 


2 


6 ID 


2 


6 


10 


2 


2 




7J3 


51 


Antimony 


Sb 


2 


2 


6 


2 


6 10 


2 


6 


JO 


2 


3 




8.64 


52 


Tellurium 


Tc 


2 


2 


6 


2 


6 10 


2 


6 


10 


2 


4 




9.01 


53 


Iodine 


j 


2 


2 


6 


2 


6 10 


2 


6 


10 


2 


5 




10.44 


54 


Xenon 


Xe 


2 


2 


6 


2 


6 JO 


2 


6 


10 


2 


6 


^So 


12.13 



Transition elemenu i I Transition elements 




Table 19.3b. Periodic Table with electron configurations, ground state terms, and ionisation energies. The 
filled shells and subshells are shaded. (The subshells 5g and 6/, 6g, 6 h are not shown, since there are no 
atoms which have electrons in these shells in their ground states). The transuranium elements with atomic 
numbers from 106 to 118, which have recently been artificially produced, are not shown here 



Atomic 


Element 




Shells 




















L5 


First 




number 






-- 


— 


— 




— 


— 


— 




— 






— configuration 


ionisation 




2 






W 








O 








P 




Q 


of the ground 


potential 










n- 


= 4 






n = 


= 5 






n- 


-6 


n = 


7 Slate 


[fiV] 










s 


P 


d 


/ 


s 


P 


d 


/ 


s 


P 


s 








55 


Cesium 


Cs 


2 


6 


10 




2 


6 






I 








3.89 




56 


Barium 


Ba 


2 


6 


10 




2 


6 






2 








5.21 




57 


Lanthanum 


La 


1 


6 


10 




2 


6 


1 




2 








5.61 




58 


Cerium 


Ce 


2 


6 


JO 


2 


2 


6 






2 








5.6 




59 


Praseodymium 


Pr 


2 


6 


10 


3 


2 


6 






2 






"ha 


5.46 




60 


Neodymium 


Nd 


2 


6 


10 


4 


2 


6 






2 








5.51 




61 


Promethium 


Pm 


2 


6 


10 


5 


2 


6 






2 












52 


Samarium 


Sm 


2 


6 


10 


6 


2 


6 






2 








5,6 




63 


Europium 


Eu 


2 


6 


10 


7 


2 


6 






2 








5,67 




64 


Gadolinium 


Gd 


2 


6 


10 


7 


2 


6 


1 




2 








6.16 




65 


Terbium 


Tb 


2 


6 


10 


9 


2 


6 






2 






— 


5.98 




66 


Dysprosium 


Dy 


2 


6 


10 


10 


2 


6 






2 








6,8 


Cd 


67 


Holmium 


Ho 


2 


6 


10 


11 


2 


6 






2 












68 


Erbium 


Er 


2 


6 


10 


12 


2 


6 






2 








6.08 




69 


Thulium 


Tm 


2 


6 


10 


13 


2 


6 






2 








5.81 




70 


Ytterbium 


Yb 


2 


6 


10 


14 


2 


6 






2 








■26 




71 


Lutetium 


Lu 


2 


6 


10 


14 


2 


6 


1 




2 








6.15 




72 


Hafnium 


Hf 


2 


6 


10 


14 


2 


6 


2 




2 








5.5 




73 


Tantalum 


Ta 


2 


6 


10 


14 


2 


6 


3 




2 








7.7 


4/5 


74 


Tungsten 


W 


2 


6 


10 


14 


2 


6 


4 




2 








7.98 


e 


75 


Rhenium 


Re 


2 


6 


10 


14 


2 


6 


5 




2 








7.87 


fll 


76 


Osmium 


Os 


2 


6 


10 


14 


1 


6 


6 




2 








8.7 


sn 


77 


Iridium 


Ir 


2 


6 


10 


14 


2 


6 


9 












9.2 




78 


Platinum 


Pt 


2 


6 


10 


14 


2 


6 


9 




1 








9.0 


s 


79 


Gold 


Au 


2 


6 


10 


14 


2 


6 


10 




1 








9.22 


H 


80 


Mercury 


Hg 


2 


6 


10 


14 


2 


6 


10 




2 








10.43 




81 


Thallium 


Tl 


2 


6 


10 


14 


2 


6 


10 




2 


1 






6.11 




82 


Lead 


Pb 


2 


6 


10 


14 


2 


6 


10 




2 


2 






7,42 




S3 


Bismuth 


Bi 


2 


6 


10 


14 


2 


6 


10 




2 


3 






7.29 




84 


Polonium 


Po 


2 


6 


10 


14 


2 


6 


to 




2 


4 






8.43 




B5 


Astatine 


At 


2 


6 


10 


14 


2 


6 


10 




2 


5 






9.5 




S6 


Radon 


Rn 


2 


6 


10 


14 


2 


6 


10 




2 


6 






10.75 




87 


Francium 


Fr 


2 


6 


10 


14 


2 


6 


10 




2 


6 


1 




4 




88 


Radium 


Ra 


2 


6 


10 


14 


2 


6 


10 




2 


6 


2 




5.28 




89 


Actinium 


Ac 


2 


6 


10 


14 


2 


6 


10 




2 


6 1 


2 








90 


Thorium 


Th 


2 


6 


10 


14 


2 


6 


10 




2 


6 2 


2 








91 


Protactinium 


Pa 


2 


6 


10 


14 


2 


6 


10 


2 


2 


6 1 


2 








92 


Uranium 


U 


2 


6 


10 


14 


2 


6 


10 


3 


2 


6 1 


2 








93 


Neptunium 


Np 


2 


6 


10 


14 


2 


6 


10 


4 


2 


6 1 


2 








94 


Plutonium 


Pu 


2 


6 


10 


14 


2 


6 


10 


6 


2 


6 


2 








95 


Americium 


Am 


2 


6 


10 


14 


2 


6 


10 


7 


2 


6 


2 








96 


Curium 


Cm 


2 


6 


10 


14 


2 


6 


10 


7 


2 


6 1 


2 








97 


Berkelium 


Bk 


2 


6 


10 


14 


2 


6 


10 


8 


2 


6 1 


2 








98 


Californium 


cr 


2 


6 


10 


14 


2 


6 


10 


10 


2 


6 


2 






< 


99 


Einsteinium 


Es 


2 


6 


JO 


14 


2 


6 


10 


1 1 


2 


6 


2 








100 


Fermium 


Fm 


2 


6 


10 


14 


2 


6 


10 


12 


2 


6 


2 








101 


Mendelevium 


Md 


2 


6 


10 


14 


2 


6 


10 


13 


2 


6 


2 








102 


Nobelium 


No 


2 


6 


10 


14 


2 


6 


10 


14 


2 


6 


2 








103 


Lawrencium 


Lr 


2 


6 


10 


14 


2 


6 


10 


14 


2 


6 1 


2 








104 


Ruthcj fond i uni 


Rf 


2 


6 


to 


14 


2 


6 


10 


14 


2 


6 2 


2 








105 


Dubnium 


Db 


2 


6 


10 


14 


2 


6 


10 


14 


2 


6 3 


2 
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As discussed in Sect. 18.8, this is a technique for studying quantitatively the binding 
energies of the inner (and outer) electrons of atoms. 

The occupation of the shells and subshells is responsible for the physical and 
chemical properties of the atoms. The alkali atoms Li, Na, K, Rb, Cs, and Fr have a 
single electron in a newly-begun shell, i.e. in the L shell, the M shell, the 45 subshell 
of the shell, etc. They are therefore easily ionised - they have low ionisation poten- 
tials - and are chemically monovalent. The halogen atoms F, Cl, Br, and I are also 
monovalent: here, just one electron is lacking to fill the outermost p subshell in the L, 
M, N, and O shells, respectively. As a result, these elements have a large electron affini- 
ty; this means that the “electronic hole” can easily be filled as a result of a chemical 
reaction. 

The changed energetic ordering of the subshells is especially noticeable in the case 
of the transition elements and the rare earths. The elements of the first transition series 
from Sc to Ni owe their characteristic properties, such as their colours and paramagne- 
tism, to their partially-filled 3d shells. Their chemical valence is determined by the 
outer (valence) electrons. Since these are close neighbours to the 3 electrons, the latter 
can readily switch between shells, giving the transition elements a variety of different 
possible valencies. 

A similar picture is found for the second transition series or 4 d transition elements, 
39Y through 46Pd, as well as for the 4/ transition elements, the lanthanoids or rare 
earths 57La through 70 Yb, where the 4/ subshell is being filled while the 65 subshell is 
already full; for the 5d elements 71LU through 7gPt; and for the 5/ elements (ac- 
tinoids) 9iPa through losLr. 

The rare earths are the most impressive example of the special properties of atoms 
in which energetically higher but spatially more contracted shells are being filled. 
Chemically, they are all very similar, since their valence configurations in the 65 (and 
5d) subshells are identical or quite similar. Their colours and paramagnetism result 
from their inner 4/ electrons. The screening of the 4/ shell by the outer 6s electrons 
is also the reason that the optical spectra of rare earths exhibit sharp lines, even when 
they are in the solid state. Rare earth atoms or ions in solids are therefore especially 
well suited as laser media (see Chap. 21). Neodymium has been mainly used for this 
purpose up to now. 

Palladium, with Z = 46, is also quite interesting: it has a fully closed subshell con- 
figuration. The fact that it is nevertheless not a noble gas becomes understandable 
when one considers the previous element, rhodium. The 55 electron which is still pre- 
sent in Rh has switched to the 4d shell in Pd; it requires only a small amount of energy 
to raise it again to the 5s shell. For this reason, Pd is not chemically inactive and not 
a noble gas, although it is considered to be a noble metal. 



19.2 From the Electron Configuration to the Atomic Term Scheme. 
Atomic Ground States 

We are now familiar with the electronic configurations of the atoms, as far as the quan- 
tum numbers n and / are concerned. It remains to discuss the energetic order of the 
states with different values of m/ and rris and the combination of the angular momenta 
of individual electrons to form the total angular momentum of the atom. 

Figure 19.4 shows the ground states and the electronic configurations of the first 11 
elements. For the beryllium and carbon atoms the lowest excited state is also given. 




19.2 Ground States of Atoms 
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Hydrogen and helium, with the ground states ^Su 2 and 'Sq have already been discussed 
at length. In lithium, the second shell is started, with a 2s electron. Beryllium, with a 
closed 2s subshell, has a ^Sq ground state. It can easily be excited to the 2p configura- 
tion, which is energetically very close to the ground state. The occupation of the 2p 
subshell begins with boron; from its spectrum it is clear that this element has a ^Pm 
ground state, i.e. that its orbital and spin angular momenta point in opposite direc- 
tions. The spectrum of carbon indicates that the spins of the two 2p electrons are paral- 
lel, so the ground state is ^Pq. The excited state C* shown in Fig. 19.4, in which there 
are one 2s and three 2p electrons, is responsible for the valence of 4 which carbon dis- 
plays in organic chemistry. The four electrons in the second shell are coupled in this 
state so that they are energetically equivalent. This leads to what is called hybridisa- 
tion (cf. Sect. 23.7) which effectively determines the character of the chemical bonding. 

It turns out that in nitrogen, the three p electrons have parallel spins, so that they 
form a "^ 53/2 ground state. Spin saturation of the p electrons then begins with oxygen. 
Fluorine lacks only one electron in the p subshell. With neon, the noble gas ground 
state ^So is reached again. The occupation of the M shell with a 2 = 3 begins with the 
alkali atom sodium. 

There are several rules for the energetic ordering of the electrons within the sub- 
shells, which hold in addition to the Pauli principle. In LS coupling, i.e. in all light 
atoms, the angular momenta in the ground state are governed by Hund's Rules. They 
are: 

1) Full shells and subshells contribute nothing to the total angular momenta L 
and 5. We have already shown this earlier. 

2) The electrons having the same value of / which are divided among the corre- 
sponding m/ subshells - called equivalent electrons - are placed into the ground state 





3s 

2p 

2s 

Is 



11 



^ 1/2 



IT 



Na 



H- 



Fig. 19.4. Electronic configurations and arrangement of the 
electron spins in the ground states of the atoms H to Na. The lowest 
excited state is also given for beryllium and carbon 
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in such a way that the resulting total spin S is maximised. States with the highest multi- 
plicities thus lie energetically lowest, e.g. triplet states are lower than singlet states. This 
is a consequence of the Pauli principle, which requires that the total wavefunction be 
antisymmetric (see Sects. 19.4, where this “antisymmetry” will be discussed in more 
detail). The higher the multiplicity, the more parallel spins there are, which are thus 
completely symmetrical in their symmetry properties. Therefore their spatial functions 
must be antisymmetric. As a result, their binding energy is maximised, because the 
mutual Coulomb repulsion of the electrons is lowest for antisymmetric spatial 
functions. 

Let us consider the nitrogen ground state as an example. Nitrogen has three elec- 
trons in the outermost subshell. It therefore has a doublet and a quartet system, i.e. 
S = 1/2 and S = 3/2 (Fig. 17.4). Given the possible combinations of the quantum 
numbers m/for the three p electrons of the configuration Xs^ls^lp^, the overall state 
can be either or "^5. Of these, the one with the lowest energy is which is the 

state with the highest multiplicity. The other two states actually occur at somewhat 
higher energy in the doublet part of the nitrogen term scheme (Fig. 17.4). 

3) When the highest value of the quantum number S is reached, the Pauli principle 
requires that the electrons be distributed among the substates m/in such a way that 
Lz= D = rriih is maximised. The resulting angular momentum quantum number 
L is then equal to For a given multiplicity S, the higher values of L give states of 
lower energy. 

4) Finally, when the spin-orbit coupling is also taken into account, the terms with the 
smallest quantum numbers J have the lowest energy in “normal” multiplets, but other- 
wise the converse holds. “Normal” means here that the subshells are less than half full. 

This rule is a consequence of the fine structure calculation which was carried out 
earlier for the one-electron atom (Sect. 12.8). As the negative charge circulates around 
the positive nucleus, the orbital magnetic field at the position of the electron is directed 
in such a way that an antiparallel orientation of L and S corresponds to a minimum 
energy. If the shell is more than half full, however, each electron which would be 
needed to fill the shell is equivalent to a positively charged “hole”. The sign of the 
magnetic field then changes, and the state with the highest J has the lowest energy. 
In Fig. 19.4 this is shown by comparison of the one-electron atom lithium (ground state 
^Si/ 2 ) and the “one-hole” atom fluorine (ground state ^^ 3 / 2 ). 

The carbon atom can serve as an example here. Its term scheme is shown in 
Fig. 17.5. In the ground state, the carbon atom has two electrons in the outermost 
subshell. It therefore has a singlet and a triplet system with S = 0 and S = 1. The 
possible ground states for the electronic configuration is^2s^2p^ are ^S, and ^P, 
Because it has the largest multiplicity, has the lowest energy, as can also be seen in 
Fig. 17.5. If the angular momentum quantum number /is included, the possibilities are 
^Pq, ^Px and ^Pi, i.e. /= 0, 1 and 2. In a “normal” order of terms, the state ^Pohas the 
lowest energy, by rule 4. The singlet terms are also observed, at somewhat higher 
energy, as the term scheme in Fig. 17.5 and Fig. 19.5 shows. 

In the oxygen atom, however, the possible ground states for the l5^25'^2/7'^ con- 
figuration are ^5, and ^P. According to rule 2, ^P has the lowest energy. Now, 
however, with an inverted term order, the state with the largest value of J (rule 4) has 
the very lowest energy. The ground state is therefore ^^ 2 * 

Table 19.3 contains the LS-ground state terms which are actually observed for all 
atoms. When the atomic number Z becomes large, they can often not be simply derived 
from the electron configurations, as they can near the beginning of the Periodic Table. 
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19.3 Excited States of Atoms and Possible Electronic Configurations. 
Complete Term Schemes 

Several complete term schemes for atoms have already been shown in the figures. Each 
of the energy terms indicated there corresponds to a particular electron configuration 
and to a certain type of coupling of the electrons in non-filled shells. The energetic posi- 
tions of these terms are uniquely determined by the energies of interaction between the 
nucleus and electrons and between the electrons, themselves. Quantitative calculations 
are extremely difficult, because atoms with more than one electron are complicated 
“many-particle” systems. 

However, using a few examples, we shall consider how many different terms are 
possible for a given electronic configuration and how these are arranged energetically. 

Our first example is an atom with two p electrons in unfilled shells, in the configura- 
tion {npY{n'p)^. If n = n\ the two electrons are equivalent and the configuration is 
np^. Carbon is a concrete example of this case. 

Figure 19.5 shows the terms discussed already in Sect. 19.2 which are possible if the 
two p electrons are coupled. First of all, the two spins can be either parallel or an- 
tiparallel. The position functions associated with these alignments differ with respect to 
Coulomb repulsion, so that according to Hund’s rule, the state with the spin quantum 
number S = 1 has a lower energy. Thus one obtains an 5 = 1 (triplet) and an S = 0 
(singlet) term scheme, with the triplet scheme having lower energies. In addition, the or- 
bital angular momenta /i = 1 and /2 = 1 can couple to give L = 2, 1 and 0. This pro- 
duces a a P and an S state. The state with the highest value of L has the lowest energy. 
This, like the other Hund rules, is not adequately understandable by intuition alone. 

For the triplet terms and the spin-orbit interaction leads to a further splitting 
into three states each. The singlet terms and the term are not subject to fine structure 
splitting, as we have seen earlier. Thus we obtain, in all, 10 spectral terms. In an 
external magnetic field, all those terms which have a total angular momentum J not 
equal to zero are further split into the m y states. The terms indicated by dashed lines in 
Fig. 19.5 are not possible for equivalent electrons (i.e. in the configuration) due to 
the Pauli principle. Since the quantum numbers n and / are the same for the two 
electrons in these configurations, they must differ in their m/or quantum numbers. 
In this case, the terms ^P, and would disobey the Pauli principle, and they are 
thus forbidden for equivalent electrons. For these terms, it is impossible to construct an 
antisymmetrical wavefunction for two p electrons with the same principal quantum 
number n. 




Fig. 19.5. Coupling of two p electrons. If the 
two electrons are equivalent {np^ configura- 
tion), the terms indicated by dashed lines are 
forbidden by the Pauli principle. The pres- 
ence of an external magnetic field lifts the J 
) degeneracy and leads to further splitting of 
the terms, which is not shown here 
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For two non-equivalent s electrons, i.e. for the configuration (nsy(n's)^, the only 
possible spectral terms are and ^Sq, For the configuration {nd)^(n’py, ^Pi, 

^^ 1 , 2,3 and ^p 2 , 3,4 are possible. It is left to the reader to confirm this. For heavy 
atoms, LS coupling is replaced by jj coupling. 

In the general case, the possible atomic terms for a given configuration of 
equivalent or nonequivalent electrons can be obtained in the following manner: only 
the electrons in open shells {n, 1) must be considered. Each electron, denoted by the in- 
dex /, is characterised by the four quantum numbers, Uj, //, mu, and m^i. These can 
be determined by applying a hypothetical magnetic field which is so strong that all the 
magnetic couplings are removed. According to Ehrenfest, a virtual, infinitely slow 
(adiabatic) change in the coupling conditions leaves the quantum numbers unchanged, 
and in particular, the number of terms remains constant in this case. 

In order to derive all the possible terms, we have to consider all the possible varia- 
tions of the couplings. The following rules hold: 

- For each value of L, we have for Mi = Imu the possible values L,L - 1, . . . -L 
for the component of the total orbital angular momentum parallel to the applied 
field; 

- Likewise, for each S, we have the possible values = S,S-1,...-S = 

- When the electrons are completely decoupled by a strong magnetic field, the in- 
dividual electrons are quantised according to = /,/- 1, . . . - / and = ±\, 

Using these rules, one can write down all the possible electronic configurations (taking 
the Pauli principle into account). Table 19.4 shows this for the case of an np^ con- 
figuration. One starts with the highest value of Mi, which corresponds to the highest 
possible L-value. In the case shown in the table, this is Mi = 2, with Ms = 0; the 
corresponding term is thus ^D. It also corresponds to four additional configurations. 
The next highest value of Mi is Mi = 1 , with the largest M 5 -value being Ms = 1 ; the 



Table 19.4. Possible Russell-Saunders terms for the electron configuration {np^). The table is intended only 
to demonstrate the counting of the possible orientations of and from the allowed combinations of 
the one-electron quantum numbers mj and (indicated by arrows). If one wishes to construct the wave- 
functions belonging to the quantum numbers, it is necessary to formulate singlet and triplet functions from 
the four possible arrangements, (ii, ti, it, ii) of two unpaired spins. More details are given in Sect. 19.4.2 



rrii - 1 


m/ = 0 


m/ = - 1 


Ml 


Ms 


Term symbol 


Ti 


ti 




2 


0 




Ti 


t 


i 


1 


0 




t 


ti 


i 


0 


0 




t 


i 


ti 


-1 


0 






ti 


ti 


-2 


0 




ti 


t 


t 


1 


1 




ti 


i 


t 


1 


0 




ti 


i 


i 


1 


-1 




t 


ti 


t 


0 


1 




i 


ti 


t 


0 


0 


3p 


i 


ti 


i 


0 


-1 




t 


t 


ti 


-1 


1 




i 


t 


ti 


-1 


0 




t 


i 


ti 


-1 


-1 




ti 




ti 


0 


0 






19.4 The Many-Electron Problem. Hartree-Fock Method 



335 



corresponding term contains all together 9 configurations. Finally, only one con- 
figuration remains, namely that corresponding to 'S. Therefore, four equivalent 
p-electrons yield the terms 'D, ^P, and ‘S, and no others, since only thus is a unique 
correspondence between terms and configurations achieved. 

All those terms which arise from allowed electronic configurations of the atom are 
given in the term schemes, for example those in Fig. 17.4, 5. Together with the selec- 
tion rules for optical transitions 



AJ=0,±\ [except for (/= 0) ^ (/= 0)] 

Amj= 0, ±\ [except for (w/= 0) = 0) where AJ = 0] 



zlS = 0 



for the atom 



AL = Q,±\ j 

Al= ± 1 for the electron changing its configuration 
Aj = 0,± \ for one of the electrons in jj coupling , 



in LS coupling 



the term schemes and the spectra of all atoms are understandable. 



19.4 The Many-Electron Problem. Hartree-Fock Method* 



19.4.1 The Two-Electron Problem 

In Chap. 10, we were able to solve exactly the hydrogen problem, in which only a single 
electron orbits the nucleus. Unfortunately, there is no exact solution for any other 
atom with more than one electron. In spite of this, it is possible, usually to a very good 
approximation, to calculate wavefunctions and energies. In order to demonstrate 
the problems which arise here, we shall consider an atom with two electrons, for 
example the helium atom or a multiply ionised atom in which only two electrons remain 
(Fig. 19.6). We distinguish the coordinates of the two electrons by using the subscripts 
7 = 1 or 7 = 2. If only one electron were present, the Hamiltonian would be 

2 2 

— — , (19.2) 

IrriQ Aneorj 



where the Laplace operator is defined by 






a" 



8jc; dyf 



■ + 




(19.3) 





336 



19. Structure of the Periodic System. Ground States of the Elements 



If we ignore the interactions between the electrons, in a classical treatment the 
energy of the whole system of two electrons is simply equal to the energy of the in- 
dividual electrons. The Hamiltonian of the whole system is thus equal to the sum of the 
Hamiltonians of the two electrons. If we make use of the translation rule of quantum 
theory, which says that the kinetic energy is always to be replaced by the Laplacian 

(19.3), multiplied by - h^/lrriQ, we obtain for the two-electron system the Hamiltonian 

(19.4) 

where the Hamiltonian (19.2) is used. Actually, however, there is a direct interaction 
between the electrons. The most important part of this is the Coulomb interaction 
energy, so that we must replace (19.4) by 

= , (19.5) 

47r^o'■l2 



where ri 2 is the distance between the two electrons. Our problem is to find an exact 
solution to the Schrodinger equation belonging to (19.5). As we remarked above, this is 
not possible in closed form. Therefore, we first consider the simpler problem in which 
the Coulomb interaction between the two electrons is left out and the Hamiltonian 

(19.4) is used in the Schrodinger equation. We make use of the fact that the single- 
particle Schrodinger equation with the Hamiltonian (19.2) has already been solved. In 
order to stay closer to reality, we include the spin of the electron and introduce the spin 
variable along with the position vector r^. As an abbreviation, we use the variable Rk^, 
which is defined as 

Rk = iTk, spin variable) = (r*, k) . (19.6) 



As we saw in Chap. 10, the wavefunction of the one-electron problem is charac- 
terised by the quantum numbers n, I, irii, and nis: The energy of this function 

is E„ I m, m- We now consider the Schrodinger equation for the total Hamiltonian 

(19.4):’ 

(19.7) 

where E^ is an abbreviation for Eiof Since the Hamiltonian ® (19.4) includes the two 
variables Ri and R 2 , the wavefunction must naturally depend on these variables: 
'P(Ri,R 2 ). In order to avoid writing out the quantum numbers explicitly in each of the 
following expressions, we introduce the abbreviation Q for the total set of quantum 
numbers, 

Q = (n,l,mi,ms) = (g,ms) . (19.8) 

The Schrodinger equation (19.7) is solved by the wavefunction 



•PiRuRi) = 'PqSRx) . 



(19.9) 



as can be immediately demonstrated by substitution. The ^^’s on the right side of the 
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equation are solutions for the one-particle Schrodinger equations. The total energy is, 
as can also be easily demonstrated, 

= + (19.10) 

where the energies E on the right side are the one-particle energies associated with the 
quantum numbers Q\ and Qi- It would thus appear that we have reproduced, with no 
limitations, the above-mentioned classical result, that the one-particle energies are 
simply additive, if we ignore the electron interactions. 

The solution (19.9) also appears to allow the case of Qx = Q 2 , which would mean 
that the two electrons would have exactly the same four quantum numbers. According 
to the Pauli principle, however, this case is excluded. Theoretical physicists have there- 
fore considered whether a wavefunction can be found which automatically excludes 
this case. To find it, we make use of the fact that not only (19.9), but also the wave- 
function 



'PQ,iRx)^QSRl) (19-11) 

satisfies the Schrodinger equation (19.7), and yields exactly the same energy (19.10) as 
(19.9). As we know, any linear combination of wavefunctions which have the same 
energy is also a solution of the Schrodinger equation and has the same energy. The 
linear combination which automatically fulfils the Pauli principle is a difference 
between (19.9) and (19.11), namely 






1 






(19.12) 



This wavefunction vanishes identically if Q\ = Q2- The factor l/]/2 serves to 
normalise the whole wavefunction. If we exchange the coordinates R\ and R 2 in 
(19.12), the wavefunction is obviously converted to its additive inverse, or in other 
words, the wavefunction is antisymmetric. This is now a formulation which makes pos- 
sible a statement of the Pauli principle even when there is interaction between the elec- 
trons: The wavefunction must be antisymmetric with respect to the coordinates R^ and 
/? 2 - It should be remembered that the variables Rj include the spin variables. 

Let us now investigate the role of the spin variables and of the corresponding spin 
wavefunctions. 

To this end, we separate the total set of quantum numbers Qj^ defined in (19.8) into 



Qk ~ ^s,k) > 



(19.13) 



as well as separating the variables as in (19.6). 

We now write the wavefunction Tg^{Rk) as the product of a wavefunction J//^^(ry^) 
for the orbital motion (space function) and a spin wavefunction 0m, ^(^) (spin func- 
tion). We then have instead of (19.12) 

1/2 



- Wq^l) 



(19.14) 
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For simplicity, we characterise the spin quantum numbers m^ = ± y by T (spin up) or i 
(spin down). We then find the following possibilities: 



1) 


Ws,l = t , 


'Ws.2= t 


(19.15) 


2) 


ms,\ = i , 


/«s,2= -i 


(19.16) 


3) 


"?s,l = t . 


'Ws.2 = i 


(19.17) 


4) 


"?s,l = J- . 


"?s,2 = t . 


(19.18) 



In the wavefunctions belonging to 1) and 2), we can factor out the spin functions and 
obtain 



1) !Ptt(^i,J?2) = —iWq^irx) H/q^{r2)-WqS''2) V/92(»*l))X0T(l)0r(2) 

1/2 



and 



2) !f'ii(/?i , Ri) = — {y/g^ (ri) y/g^{r 2 ) - {t 2 ) Wq^iTi)) x 0i(l) (p[{2) 

j/2 



(19.19) 



(19.20) 



The wavefunction for two electrons may be written as a product of the space function 
and the spin function: 



nR\ , Ri) = T 2 ) (1 , 2) , where 


(19.21) 


<^TT(1.2) = 0t(1)0t( 2) and 


(19.22) 


0u(l,2) = 0i(l)0i(2) . 


(19.23) 


Let us consider the precise meaning of the spin wavefunctions 0. Since they refer to 
two electrons, it seems reasonable of inquire into the total spin of the pair of electrons. 
The z component of the latter corresponds to the operator 


^z,\ + ^z,2 


(19.24) 


where is the operator for the z component of the spin of electron k. As one can 

quickly verify, the following relations hold: 


2^z0t(1)0t(2) = ^0t(1)0t( 2) and 


(19.25) 


Z,0i(\)0iC2)= -h(t>i{\)(t>i{2) , 


(19.26) 


i.e. the spin functions (19.22, 23) are eigenfunctions of (19.24). Likewise, 
fy (cf. Problem 19.8) that they are simultaneously eigenfunctions of 


one can veri- 




(19.27) 



with the eigenvalue 2h^. 
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Let us now return to the possible spin arrangements (19.15-18) and write down 
the remaining wavefunctions which correspond to (19.17, 18): 

3) nRuRi) = V^<,,(/-2)0T(l)0i(2)- v/,,(r2)v/^^(n)0T(2)0i(l)) (19.28) 



4) T{Rx,Ri) = ^{¥q^irx) V'92(''2)0i(1)0t( 2)- V'9,(''2) V/92(''i)0i(2)0T(l)) • 

j/2 (19.29) 

These may evidently not be written in the form of products (19.21). However, as we 
know, we may take linear combinations of wavefunctions which belong to the same en- 
ergy; in fact we must do so when applying perturbation theory to degenerate states. We 
thus take the sum of 3) and 4) and obtain after rearranging 

5) = 3) + 4) = ^ (V',j(/-i) - ¥qi (ri) ¥q 2 ^rx)) x (0t(l) 0i(2) + 0r(2) 0i(l)) 



= •Pn+iT(/?i,/?2) (19.30) 

and 

6) = 3)-4) = 1(1^9, (/-j) ¥q^(r2)+ ¥qS^2) ^'^^(O) X (0t( 1) 0i(2) ~ 0t(2) 0i(l)) 

2 

= 'Pn-n(Ri,R2) . 

These new wavefunctions thus indeed have the desired form (19.21) and are further- 
more eigenfunctions of 

^z(0T(l)0i(2)±0t(2)0i(l)) = O (19.32) 

z\<t>}{\) 0i(2) + 0 t( 2) 0i(l)) = 2 /i '(0T(1) 0i(2) + 0 t( 2) 0i(l)) (19.33) 

^'(0r(l) 0i(2) - 0 t( 2) 0i(l)) = 0 . (19.34) 

Clearly, the wavefunction 

«>=0T(l)0i(2)-0t(2)0l(l) (19.35) 

corresponds to a singlet state (S = 0), while the spin wave-functions 



0t(1)0t(2) 

0t(l)0i(2) + 0t(2)0i(l) 
^ 0i(l)0l(2) 



(19.36) 

(19.37) 

(19.38) 



correspond to a triplet state (S = 1) with = 1 >0, - 1 . 

The corresponding space functions, which are the wavefunctions for the orbital mo- 
tion, have differing symmetries with respect to the space coordinates in the singlet and 
the triplet states. 
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The triplet state has antisymmetric space functions; thus, the probability density for 
both electrons being in the same place vanishes, and it becomes small when the elec- 
trons approach each other. The positive Coulomb repulsion energy between the elec- 
trons is thus lower for the triplet state than for the singlet state, which has a space func- 
tion symmetric in the space coordinates. This forms the theoretical basis for Hund’s 
rules (see Sect. 19.2). 



19.4.2 Many Electrons Without Mutual Interactions 

The considerations which led us to the wave function (19.12) can be generalised. We 
shall give the results here without proof. In the general case, we are concerned with the 
variables i?i, . . ./?tv of electrons. In the absence of interactions among the electrons, 
the Hamiltonian is a sum of one-electron Hamiltonians: 

Jf0= (19.39) 

7 = 1 

The solution of the Schrodinger equation 

^^'P=Et^ (19.40) 

associated with (19.39) is naturally a function of the coordinates R \, . 

nRuR2,...RN)- (19.41) 

Retaining the notation introduced above, we can immediately show that the 
Schrodinger equation (19.40) is solved by the product 

nRuRi, . . Rn) = 'Pq,{Rx) 'Pq,{Ri) . . . 'Pq^(Rn) ■ (19.42) 



This approach is often called the Hartree method. The energy of the solution is given 
by 

Et = Eq^ + Eq^-\- . . .+Eq^, (19.43) 

where the energies on the right side are again the energies of the individual electrons. 

The solution (19.42) is not yet compatible with the Pauli principle, because it allows 
solutions Qi = for the pair /, k. The solution which is compatible with the Pauli 
principle is given by a determinant of the form 



nRuR2,...RN) 



1 

Vm 



!Pq7/?i) !Pq7/?2) ...'PqSRn) 
^Q,(Rx)^Q,iR2) ... 



'PQ^m^Q^{R2)...'PQ^{RN) 



(19.44) 



A determinant changes its sign if two rows or two columns are exchanged. If we 
exchange two variables /?/ and Rj^, this is equivalent to exchanging two columns in 
(19.44). Thus (19.44) guarantees the antisymmetry of the wavefunction. Since a deter- 
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minant is zero if any two rows or columns are identical, we see that (19.44) vanishes if 
Qi = Qk for any pair /, k. 

We shall now discuss a special case of (19.44), namely the case that all the electrons 
have parallel spins. Then the spin quantum number is e.g. 1/2 for all j. If we also 
ignore spin-orbit coupling, then the wavefunction of an individual electron can be 
written as a product (where we write mj^ for the quantum number rrii): 

^q^{Rj) = (19.45) 

where y/ refers to the orbital motion and 0 to the spin. As can be shown using elemen- 
tary rules for determinants, substitution of (19.44) in (19.45) yields 



W,.../?n) 










0t(l)...0t(^), (19.46) 



i.e. the total wavefunction is now a product of a spin function which is symmetric 
(because the spin quantum numbers of the electrons are identical) and an antisym- 
metric wavefunction, which is represented by a determinant as in (19.46) and depends 
only on the position variable rj. 

If we set the coordinates for a pair /, k equal, in the determinant, it will of 

course vanish. This means that two electrons with parallel spins cannot occupy the 
same position. Since the wavefunction y/ is continuous, so is W. This means that the 
probability of finding two electrons at the same position goes to zero continuously if 
the coordinates of the two become equal. The Pauli principle thus provides auto- 
matically for a certain distance between electrons with parallel spins. 



19.4.3 Coulomb Interaction of Electrons. Hartree and Hartree-Fock Methods 

We now turn to the actual problem, in which the Coulomb interaction among the 
electrons is taken into account. The energy of the Coulomb interaction between the 
pair of electrons y, k is given by e^/{4n8orji^) {tj^ is the distance between the two elec- 
trons), so the Hamiltonian is 

N 

1 1 . (19.47) 

j=\ j<k 4neorjic 

The summation rule j<k prevents the interaction energy between pairs of electrons 
being counted twice. If instead of using this rule, we sum over all indices j and k, with 
the limitation that y + /:, we must set a factor 1/2 in front of the interaction sum: 

N i ^2 

jf= I I . (19.48) 

Our next task is to solve the Schrodinger equation 






(19.49) 
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for the Hamiltonian (19.48). Since there is no exact solution we follow this train of 
thought: First, since the one-electron problem has already been solved, the wavefunc- 
tions of the individual electrons, in the form 



'Pq{Rj) (19.50) 

are already known. There is a charge density distribution associated with the wavefunc- 
tion (19.50), 



Q(rj) = e\'¥Q{Rj)\\ (19.51) 

As we know from electrostatics, there is an interaction energy between a charge at 
position r and the charge distribution given by (19.51). This energy is given by the 
product of the charge and the electrostatic potential. The latter can be calculated from 
the charge distribution q. In all, the Coulomb interaction energy is expressed by 



V(r) = 



1 

47160 






(19.52) 



where the integral covers the total volume (the volume element is indicated here by c?r, 
in order to distinguish it from the interaction energy V). If we substitute (19.51) in 
(19.52), we obtain 



V(r) = 



4neo \r-rj\ 



(19.53) 



The main idea of the Hartree and Hartree-Fock methods, which are discussed 
below, is to reduce the many-electron problem to a one-electron problem. Let us 
consider a single electron. It is moving not only in the field of the atomic nucleus, but 
also in the field of all the other electrons. The simplifying assumption is then made that 
the electron density distribution of all the other electrons can be given, as a first 
approximation, by one-electron wavefunctions (19.50). In order to calculate the wave- 
function of the chosen electron, we must solve a Schrodinger equation in which both 
the Coulomb potential of the nucleus and the interaction energy with all the other elec- 
trons appear. If the chosen electron has the subscript k, and thus the coordinate the 
Schrodinger equation is 






Ze^ 



+ Vf\r,) 



i,,t\Rk) = EwnRk) 



(1)/ 



(19.54) 



2/77o 47reo7‘* 

vP is the Coulomb interaction energy with all the other electrons, and is obtained from 



Vk(r)= I'j ' ^ drj 

7=1 47ieo\r-rj\ 



(19.55) 



by substituting, as a first approximation, the wavefunction i//^ for ij/qj in (19.55) \ The 



1 



The prime sign ( ') on the sum in (19.55) indicates that j^k 
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superscript (0) means that we use a given (or guessed) wavefunction to start the whole 
procedure. In particular, may be a wavefunction belonging to a given potential. 
Similarly, the superscript (1) indicates that this wavefunction for the electron k was 
obtained from (19.54) in the first step of an iterative process. In the second step, we 
now use the wavefunction for y/ in (19.55). This yields the new wavefunction y/^^^ 
in a Schrodinger equation analogous to (19.54). The process is repeated until there are 
no more significant changes in the wavefunctions y/^\ or in other words, until the 
method converges. Seen schematically, we have 

y{\)^ ^( 2 )_^ y{2)^ ^ ^ ^{j) ^ ^ ^ ( 19 . 56 ) 

The method described above is somewhat heuristic, of course. It is desirable to set it 
on a firm mathematical basis, which is possible, but we cannot go into the details here 
because of space limitations. It can be shown that the Schrodinger equation (19.49) 
may be solved by using a variational principle. According to this principle, the 
expression 

(19.57) 

must be equal to an extremum (maximum or minimum) with the secondary condition 
that the wavefunctions are normalised: 

|!P*¥^rfTi...rfTiv=l. (19.58) 

If we use a wavefunction !Pof the form (19.42), i.e. a product wavefunction in such a 
variational procedure, we find a set of Schrodinger equations for the individual wave- 
functions: 




Ze^ 



+ Vk{Tk) 



WQ^i^k) - E y/qjfik) , 



(19.59) 



where Vk(rj^) is defined by (19.55). The Hartree method just consists of solving these 
Schrodinger equations (19.59) iteratively in the way indicated above. 

The disadvantage of the Hartree method is obvious, in that it uses product trial 
functions for !P, which, as we know, violate the Pauli principle. The key to expanding 
the Hartree method to include the Pauli principle lies in utilising the determinant trial 
functions (19.44) for Win the variation equations (19.57) and (19.58). The correspond- 
ing calculations are rather long; we shall therefore simply give the result here. For the 
individual wavefunctions, we find a set of Schrodinger equations of the form 
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(19.60) 



The term in brackets on the left-hand side, and the right-hand side of (19.60) agree with 
the Hartree equation (19.59). The additional term containing the sum over j is new; it is 
referred to as an exchange term. The reason for this terminology is the following: if one 
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compares y/Qj^ from (19.60) with this exchange term, one recognises that the wave- 
function if/Qj^ has exchanged roles with the wavefunction y/Qj, since the electron coor- 
dinates Rk and Rj have been exchanged. This exchange term is a direct result of the 
determinant approach, i.e. of the antisymmetry of the wavefunctions. Intuitively ex- 
plained, it means that the Coulomb interaction energy between electrons with parallel 
spins is reduced relative to that between electrons with antiparallel spins. Since the 
interaction potentials V themselves depend on the wavefunctions y/, a solution to 
(19.60) is difficult to find. For a solution, one therefore resorts again to the Hartree- 
Fock method, according to which (19.60) is solved iteratively following the scheme 
(19.56). 



Problems 

19.1 How many electrons do those atoms have in which the following shells are filled 
in the ground state: 

a) the K and the L shells, the 35 subshell and half the 3/? subshell? 

b) the K, L and M shells and the 45, 4p and 4d subshells? 

What are the two elements in (a) and (b)? 

19.2 Show that for a closed nl shell, L = S = 0. 

19.3 a) Two equivalent p electrons have strong spin-orbit coupling. Calculate the 
possible values of the total angular momentum quantum number, if the coupling is 
purely jj. Remember to take the Pauli principle into account. 

b) Consider the same problem in the case of weak spin-orbit coupling, so that LS 
coupling of the two p electrons can be assumed. Do the same values of J occur with 
equal frequency in the two cases? 

Hint: Since the particles cannot be distinguished, configurations which are the same 
except for the exchange of electron indices may only be counted once. In case (a), the 
Pauli principle can be taken into account in this way, because the two electrons cannot 
have identical sets of quantum numbers. In case (b), the Pauli principle is taken into 
acount by the requirement that the wavefunction of the overall state must change its 
sign when the particle indices are exchanged (antisymmetry under particle exchange). 

19.4 Give the terms for the following configurations and indicate in each case which 
term has the lowest energy: 

(a) ns, (b) np\ (c) {np)\n's), (d) np\ (e) (nd^){n'p), (f) {nd){n'd). 

19.5 In a diagram, show the occupied electronic states of the Si, Cl and As atoms 
when the atoms are in the ground state configurations. 

In each case, write the electronic configuration and the terms of the ground state. 
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19.6 Give the ground state configuration and the number of unpaired electrons in the 
following atoms: 

a) S 

b) Ca 

c) Fe 

d) Br . 

19.7 Calculate the terms for the np^ configuration. Give the values of S, L and J for 
the terms 

‘So, ^Si/2, ‘Pi, ^P2, *D2and %n- 

Decide which of the terms correspond to the configuration. Use your result to de- 
ter mine the ground and first excited states of titanium. 

19.8 Demonstrate that the relations (19.25, 26), and (19.32) are correct. Show also 
that (19.35-38) are eigenfunctions of and confirm the eigenvalues given in the 
chapter. 

Hint' Write as ((Tj + ^ 2 )^ ~ ^2 2(Tj(T2. 

19.9 Is there a triplet state when the quantum numbers of the two electrons are identi- 
cal? Justify your answer. 

Hint: Consider the symmetry of the spatial wavefunction. 
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20.1 Influence of the Atomic Nucleus on Atomic Spectra 

The atomic nucleus generates the central field in which the electrons of the atom move. 
The nuclear charge number (atomic number) therefore appears in all the equations for 
spectral energy terms of atoms. In addition, there are other influences of the nucleus on 
the electronic spectra, the measurement of which generally requires an especially high 
spectral resolution. Therefore, one refers to a hyperfine structure of the spectral lines. 
By contrast, the effects of the interactions of the electrons among themselves as well as 
those of electronic spins with the orbital moments, which we have treated in the pre- 
ceding chapters, are referred to as fine structure. 

To measure the hyperfine structure, one needs spectral apparatus of the highest 
resolving power. Until a few years ago, this meant interference spectrometers with 
resolving powers of the order of 10^. In recent years, laser-spectroscopic methods have 
been developed, which allow still higher resolving powers to be attained. More details 
will be given in Chap. 22. 

The three most important influences of the nucleus on the electronic spectra, aside 
from the effect of the Coulomb field of the nuclear charge + Ze, are the following: 

- The motion of the nucleus around the common centre of gravity of the nucleus and 
the electrons leads (in the model for Bohr and Sommerfeld) to a dependence of the 
Rydberg constant on the nuclear mass. As a result, isotopes, that is, atoms with the 
same nuclear charge Z but with differing masses, have energy terms which differ 
somewhat. Their spectral lines thus are also different. This isotope effect, which we 
have already treated (Sect. 8.5), can, in the case of heavy elements, only be 
measured with a high spectral resolution, since the relative mass differences and 
thus the differences in the energy terms of different isotopes are then minimal. 

- The differing volumes of isotopic nuclei, having the same charge Z but different 
neutron numbers and therefore different masses, also lead to an isotope effect in 
the spectra, the so-called volume effect. Since the density of nuclear matter is 
constant, and therefore the nuclear volume is simply proportional to the number of 
nucleons in the nucleus, different isotopes of a particular chemical element have 
differing nuclear sizes. From this follows a somewhat differing interaction between 



, 168 cm 



y234 y235 y236 y238 



Fig. 20.1. Experimental example of the isotope effect: the isotopic splitting of the 
4244.4 A line in the uranium spectrum, observed in the 6th order in a 9m grating 
spectrograph. [After H. G. Kuhn: Atomic Spectra (Longmans, London 1962, 
Plate 18)] 
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the nucleus and the electrons. These two isotope effects permit a determination of 
the number, masses, and abundances of the isotopes of a given element. Figure 20.1 
shows an experimental example. 

- The term hyperfine structure in the precise sense refers, finally, to another kind of 
structure in atomic spactra, which results from the existence of spins and magnetic 
moments of the atomic nuclei. The presence of these properties in nuclei was first 
postulated by Pauli in 1924 as a means of explaining spectroscopic observations. In 
the year 1934, Schuler further postulated the existence of electric quadrupole 
moments in nuclei. The interactions of these nuclear moments with the electrons 
lead to an additional splitting of the spectral lines, namely the above-mentioned 
hyperfine structure. This will be the topic of the present chapter. 



20.2 Spins and Magnetic Moments of Atomic Nuclei 

Atomic nuclei possess a mechanical angular momentum 

\I\ = ]/l{I+\)h. ( 20 . 1 ) 

Here, the quantum number I may be integral or half-integral. Stable atomic nuclei are 
known with /-values between 0 and 1 5/2. One says, briefly, “The nucleus has a spin /” . 
This means that the largest measurable component of the nuclear angular momentum I 
has the value I h. As we have already seen in the case of electronic angular momenta, 
only the component in a special direction z, for example in the direction of an applied 
magnetic field B, is observable. The x and y components have time averages equal to 
zero. For the z component, we have 

{I)z=mjh with W/= /, /- 1, . . . -/. (20.2) 

There are thus 2/-h 1 possible orientations of the nuclear angular momentum relative to 
the special direction, corresponding to the possible values of the nuclear magnetic 
quantum number ruj; for example, for the hydrogen nucleus - the proton - we find 
/= |/| = (1^/2)^, I,= ±i-h (see also Fig. 20.2). 

A magnetic moment juj is connected with the nuclear angular momentum; the two 
are proportional: 



Pi=yl- (20.3 a) 

The constant of proportionality y is known as the gyromagnetic ratio. The unit of the 
nuclear magnetic moment is the nuclear magneton, ju^= eh/lm^, where mp is the 
proton mass. The nuclear magneton is thus smaller than the Bohr magneton by the 
mass ratio of the electron and the proton: 

Pn = /iB/1836 = 0.5050824 • 10“^^ Am^or JT“^ . 

With this definition, we may write the magnetic moment as 



h 



(20.3 b) 
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Fig. 20.2. The nuclear spin / (here for the case / = 2) has discrete 
orientations relative to a special direction, here an externally 
applied magnetic field Bq {left). Correspondingly, the measurable 
component of the magnetic moment {middle) and the potential 
energy {right) have discrete values 
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The nuclear g factor defined in (20.3 b) is a dimensionless number gj = yh/ju^ (cf. Sect. 
12.2). In contrast to the ^/factor for the electrons, the nuclear g factor cannot, as yet, 
be calculated from other quantum numbers. Otherwise, the relations between spins and 
magnetic moments in nuclei and in the electronic shells are completely analogous. 

Following the quantisation rules for the angular momentum, one can also only 
observe the component of the nuclear magnetic moment along a special direction z. 
From (20.2), this component is 



(Mi)z=^ y(I)z = 






(20.4 a) 



The maximum possible value for mjis /, Fig. 20.2. According to (20.2), the maximum 
observable value of jujis 

li^/lmax = • (20.4b) 

One thus says, simplifying, that the nucleus has the magnetic moment juj = gifi^L This 
is the experimental quantity which appears in tabulations. 

To clarify these concepts, a few further examples: For the hydrogen nucleus, the 
proton, we have 

+2.19n^-, I=\- gi=5.58; 

for the potassium nucleus with mass number 40, 

-1.29//n; 1=4; gr= -0.32. 

The sign of /i/ is also contained in the gj factor: gj>0 means that /i/ > 0 also, i.e. the 
directions of the spin /and of the moment ///are parallel. 

An example for a positive sign of ///is the proton. It has a positive charge; 
therefore, it is intuitively understandable that the factor connecting the angular 
momentum with the magnetic moment of the proton has a sign opposite to that in the 
case of the electron (Sect. 12.4). 
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There are also numerous nuclei with vanishing spins, 1=0. These nuclei do not con- 
tribute to the hyperfine structure. Examples of this type of nucleus are 

4tt„ 12p 16p. 40^„ 56t7« 88q_ 114^. ISOuf 208bu 238tt 

2 rte, 6 C, 8 U, 20'-a, 26^6, 3sbr, 48 Cd, 72 Ht, 82 “D, 92 U . 



A detailed theoretical description of moments, spins and ^-factors of the nucleus is 
in the province of nuclear physics and will not be gone into here. 

The quantities I or I can be calculated from experimental data, such as the number 
of the hyperfine components of spectral lines. The gyromagnetic ratio y, or the gj 
factor, is measured experimentally by nuclear magnetic resonance, which will be the 
subject of the next sections. 

Since the magnetic moments of nuclei are several orders of magnitude smaller than 
those of electrons, the interactions to be expected with external or internal fields are like- 
wise three orders of magnitude smaller; therefore, the name hyperfine structure is fitting. 



20.3 The Hyperfine Interaction 

We now wish to calculate the interaction energy between a nuclear magnetic moment 
and the magnetic field which the electrons produce at the site of the nucleus. 

In solving the analogous problem of the fine structure interaction between the spin 
of the orbital moments of electrons, we proceeded roughly as follows. 

The magnetic field Bi which is produced by the orbital motion of the electron inter- 
acts with the electron’s magnetic moment and orients its spins. The spin and the orbital 
angular momentum combine according to the rules in Sect. 12.7, 8 to yield a total 
angular momentum j. In an atom with several electrons and in LS coupling, the corre- 
sponding vectors L, S, and J are to be used. In the vector model, the angular momenta 
L and S precess about the space-fixed vector J (Fig. 20.3 a). An additional magnetic 
energy results from the interaction between the magnetic moment //jand the electronic 
field Bl (Fig. 20.4a): 

(20.5) 

where the indices FS stand for fine structure. 

The hyperfine interaction energy is calculated in an exactly analogous way. It is three 
orders of magnitude smaller than the fine structure energy because, this is the ratio of the 
magnetic moments of nuclei and electrons, respectively. Therefore the magnetic coupling 
of the atomic electrons among themselves is not affected by the hyperfine interaction. At 
the position of the nucleus, there is a magnetic field (Fig. 20.4b). This influences the 
magnetic moment of the nucleus and orients the nuclear spin. The result of this interac- 
tion is a coupling of the angular momenta of the electrons (/) and the nucleus (/) to a 
new total angular momentum F. In analogy to LS coupling for electrons (Fig. 20.3 b): 

F = J+I (20.6) 

where the absolute value of the total angular momentum IFI = \/F{F+ \)h. The quan- 
tum number F of the total angular momentum F can have the values 



F=J+I,J+I-\...J-I. 
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Fig. 20.3. a) Coupling of the S (spin) and L (orbital angular 
momentum) vectors to a total electron angular momentum vector J 
for the electron shell, b) Coupling of the J (electron angular 
momentum) and I (nuclear angular momentum) vectors to the total 
angular momentum F of the atom 





Fig. 20.4. a) Calculation of the spin-orbit interaction: the magnetic 
moment of the electron interacts with the magnetic field of the 
orbital motion b) Calculation of the hyperfine interaction: the 
magnetic moment of the nucleus interacts with the magnetic field 
of the electron shells B j 



This amounts to (27+ 1) or (27+ 1) possibilities, depending on whether I is smaller or 
larger than 7. 

In the vector model, the vectors I and 7 precess in quantised positions around the 
spatially fixed vector F, Fig. 20.3 b. The number of hyperfine levels is thus uniquely 
determined by the quantum numbers 7 and 7. If 7 is known, 7 can thus be measured 
if 7 is smaller than 7. 

The additional magnetic energy due to the hyperfine interaction is 

l^HFS = ~ Fr (20.1) 

where Bj is the magnetic field generated at the position of the nucleus by the electron 
shell (Fig. 20.4b). (HFS is the abbreviation for “hyperfine structure”.) 

Further calculation of the hyperfine interaction is quite analogous to calculation of 
the fine structure interaction. Each of the possible orientations of the nuclear spin in 
the field B j corresponds to a definite potential energy. According to (20.7), it is 



kkps — ~ cos (jt[,Bj) 

= gjfi^]/l(I+i)BjCOs(I,J) , (20.8) 

if we substitute 1) for |^/|, according to (20.3b). The reversed sign in the 

second line of (20.8) is a result of the fact that the vectors Bj and J are antiparallel. 
Furthermore, in the quantum mechanical formulation, 

(20.9) 

2|/7(7+l) 1/7(7+ 1) 

according to the law of cosines, as shown in Sect. 13.3.5. This can be written as 
F(F+ 1) = 7(7+ 1) + 7(7+ 1) + 21/7(7+1)1/7(7+1) cos(7,7) . 

With (20.9), one finally arrives at the energy of the hyperfine interaction, 

zI^hfs = - [^(^+ !)-/(/+ 1) - J(J+ 1)] , 

2 



( 20 . 10 ) 
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with 

l//(7+l) ' 

Here, we have again defined a measurable quantity, the hyperfine constant or inter- 
val constant a. It is not to be confused with the fine-structure constant (12.28), which is 
also denoted by a. Figure 20.5 shows an example of (20.10), the hyperfine splitting of 
a term with 7=3/2 and 7=3/2. 
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Fig. 20.5. Hyperfine splitting and the interval rule: hy- 
perfine splitting of a state with J = 3/2 and I = 3/2. The 
numerical values of the quantum number F and the de- 
gree of degeneracy 2F+ 1 (with respect to rup) are given 
on the right 



The field strength produced by the electrons at the nucleus cannot, in general, be 
measured independantly of the magnetic properties of the nucleus itself. In order to ex- 
tract from a measurement of the hyperfine splitting of spectral lines, one must know 
the nuclear moment or gj, as seen from (20.10). An exact calculation of Bjis possible 
only in simple particular cases, e.g. for the hydrogen atom, because the wavefunctions 
are then precisely known (see Chap. 10 and Sect. 12.10). 

In the case of ^-electrons, the magnetic field Bj is mainly due to the nonvanishing 
electron density at the nucleus. The magnetic moments of the electrons interact directly 
with the nuclear moment, which is distributed over the finite nuclear volume. This in- 
teraction is known as the (Fermi) contact interaction; it is isotropic. For an s electron 
CS\/2, J= 1/2), it may be calculated exactly following Fermi, thus for example for the 
ground states of alkali atoms. One obtains the following expression for the hyperfine 
constant a: 

« = , ( 20 . 11 ) 

where the symbol has been used for the electronic g'-factor. 

Here, y/iO) is the wavefunction of the electron at the nucleus. For an s electron in 
a hydrogen atom (cf. Chap. 10), we have | (^(0)/ = 1/nr^, with /"h being the radius 
of the first Bohr orbit. The numerical calculation gives a = 1420 MHz or 0.0475 cm“’ 
or 5.9- 10“® eV (see also Fig. 20.7). In general, for s electrons in hydrogen-like atoms, 
we have as a good approximation 

|v/(0)/=zV(7r«V|i) . 



For electrons with orbital angular momentum / > 0, i.e. for p, d,f, ... electrons, the 
probability density at the nucleus vanishes. Here, the field is calculated from the di- 
pole-dipole interaction between the electronic moment and the nuclear moment. The 
hyperfine constant a becomes anisotropic. 

Even for p electrons, one obtains values of which are an order of magnitude 
smaller. A term diagram of the hydrogen atom including the hyperfine interaction is 
shown in Fig. 20.6. 
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Fig. 20.6. Term diagram of the H 
atom including the hyperfine 
splitting. All fine structure terms 
(see Fig. 12.20) are doubly split 
by the interaction with the pro- 
ton. The magnitude of the split- 
ting is not drawn to scale here. In 
particular, the scale was enlarged 
on going from the Bohr energy 
levels to the fine structure and 
again in going to the hyperfine 
structure, in order to show the 
energy differences which become 
smaller and smaller from left to 
right. The scale increases from 
below to above in the case of the 
hyperfine splitting; the latter is 
largest in the ground state, n = \ . 
The numerical values of the hy- 
perfine splittings are given in 
MHz 



Bohr Dirac Lamb Hyperfine - 

structure 



For other atoms, the only possibility is to use approximate methods. In general, however, 

- Bj and thus Fhfs large when the electrons are close to the nucleus, i.e. for 

large Z, small n and small /.We have already mentioned in Sect. 8.6 that heavy 
atoms can be so highly ionised with the use of a high energy accelerator that they 
contain only a single 1 5 electron. In this way, for example, it was possible to obtain 
the ion In this case, the nuclear spin is /= 9/2, and for the state there 

are thus two possibilities, F = 5 and F = A. The “hyperfine” splitting is so large that 
the optical transition between the two F terms lies in the ultraviolet range near 
250 nm. 

- The number of hyperfine components split apart is the same for a given type of atom (i.e. 
for a fixed nuclear spin quantum number I) in all its excitation states, so long as />/. 

- For a vanishing nuclear spin (/ = 0) or vanishing electron angular momentum (/ = 0), 
Fhfs = so there is no hyperfine splitting. 

Table 20.1 lists some numerical values for Bj. 

The magnetic field BjdX the nucleus can be determined from measurements of the 
hyperfine splitting of spectral lines, if the nuclear moments are known. In many atoms, 
they can be obtained by measurements in the presence of external fields (see Chap. 13). 
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Table 20.1. Magnetic field Bj produced at the nucleus by the outermost electron, for various terms (after 
Segre, fields in Tesla) 





n 


^^1/2 


2p 

n/2 


2p 

^3/2 


Na 


3 


45 


4.2 


2.5 


K 


4 


63 


7.9 


4.6 


Rb 


5 


130 


16 


8.6 


Cs 


6 


210 


28 


13 



The magnitude of the hyperfine splitting of spectral lines is, for example 0.027 cm“ 
for the lowest ^S \/2 term of lithium and 0.3 cm“^ for the lowest ^S \/2 term of cesium. 

The measurement of the hyperfine splitting Fhfs of spectral terms is thus one 
method by which it is possible to calculate a product of the nuclear property {gj) and 
the electron shell property (Bj), One must still take into account that the observed spec- 
tral lines are transitions between two terms, which are in general both subject to hyper- 
fine splitting. The selection rule for optical transitions is AF = 0, ± 1. This will not be 
derived here. 

The evaluation of hyperfine spectra is simplified by the interval rule. It follows 
from (20.10) that the relative distance between the hyperfine terms for the quantum 
numbers F and F + 1 is 



AEf+x-AEf= a(F+ 1) . (20.12) 

The distance between two terms in a hyperfine structure multiplet is thus propor- 
tional to the larger of the two F values, and the spacings within the multiplet are in the 
ratio F: (F- 1) : (F- 2), etc. This is shown in Fig. 20.5. 

Another important matter is the sequence of the levels. If the nuclear moment is 
positive, so is the hyperfine constant a, so that Fhfs increases as F increases. This offers 
an experimental method of determining the sign of the nuclear moment ///. 

Finally, the intensities of the spectral lines composing a hyperfine multiplet are also 
characteristic. In the absence of an external magnetic field, the terms are still (2F+ 1)- 
fold degenerate with respect to m/r. (The degree of degeneracy is shown in Fig. 20.5.) 
Terms with different values of F therefore have different statistical weights, and their 
intensities are proportional to (2F+ 1). To be sure, it is generally the case that both of 
the levels between which the transition occurs have hyperfine structure. This com- 
plicates the pattern to be expected. 

In all, hyperfine spectra are characterised by very typical ratios of intensities and 
distances between the hyperfine components. 



20.4 Hyperfine Structure in the Ground State of the Hydrogen 
Atom, the Sodium Atom, and the Hydrogen-like Ion g 3 Bi 

Here, we shall consider the hyperfine splitting of atomic terms, which was treated in the 
previous section, in more detail. In the ground state of the hydrogen atom, the proton 
and the electron spins can only be oriented parallel or antiparallel to one another. This 
yields the two possible values for F: 1 or 0. The energy difference between these two 
configurations is (Fig. 20.7) 
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Fig. 20.7. Hyperfine structure of the hydrogen ground state. 
The spins of electrons and protons can be parallel or anti- 
parallel to each other. See also Sect. 12.1 



Zlv = 0.0475 cm ^ or 
Zlv= 1420 MHz or 
A = 21 cm . 

This energy difference can be determined either by direct absorption of high-frequency 
radiation of the appropriate frequency, or from the splitting of the spectral lines which 
end on the « = 1, / = 0 level, if sufficient resolution is available. 

Since the energy difference A v can be exactly calculated if the magnetic moments of 
the proton and electron are known (cf. Sect. 20.11), and since the moment of the 
proton is known to great precision from nuclear resonance measurements (Sect. 20.6), 
an exact measurement of Zl v in hydrogen showed, for the first time, the deviation of 
the g factor of the electron from the value g = 2. The measured value of zl v differed 
slightly from the value calculated with g = 2, and from the measurement, the g factor 
was calculated to be ^ = 2.0023. This and the measurement of the Lamb shift were 
instrumental in stimulating the development of quantum electrodynamics. 

In interstellar space, hydrogen is present in extremely low concentrations, but due 
to the immense volume of the universe, the total amount of it is vast. Therefore it is to 
be expected that radio-frequency radiation at 21 cm wavelength would be emitted and 
absorbed there in measurable amounts. This radiation was actually discovered in 1951 
with large radio telescopes. Since then, the 21 cm line of hydrogen has played an impor- 
tant part in radio-astronomy. For example, there is an especially large amount of this 
radiation in the region of the Milky Way. The motion of parts of the Milky Way 
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Fig. 20.8. Hyperfine splitting of the lowest terms of the sodium 
atom with the allowed transitions. The numerical values for the 
hyperfine splitting are given here in MHz 




Fig. 20.9. Three components of the hyperfine-split D 2 line of 
sodium (Fig. 20.7), measured by recording the resonance radiation 
of sodium atoms in an atomic beam irradiated by light from a nar- 
row-band, variable-frequency dye laser. The AF values for the 
ground and excited state are given. The curve on the left is 
calculated from the natural linewidth, the curve on the right was 
observed experimentally. The measured (15 MHz) and calculated 
natural linewidths are very close [from Lange et al.: Opt. Commun. 
8, 157 (1973)] 
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relative to the earth can be deduced from the Doppler shifts of this line, and analysis of 
this motion has confirmed that the Milky Way is a spiral galaxy. This is probably the 
most spectacular application of the study of hyperfine structure to astrophysics. 

The hyperfine splitting of three levels of the sodium atom, ^Sj/2, ^P\n and ^Pyi is 
shown in Fig. 20.8. The nuclear spin of the atom is / = 3/2. There are four values of the 
quantum number F for P3/2, F= 3, 2, 1 and 0; for /= 1/2, F = 2 or 1. Together with 
the selection rule for optical transitions, AF = 0, ±1, these values yield the spectral 
lines indicated in Fig. 20.8. Because the linewidth is finite, it is usually possible to see 
only two components of the line Di with A = 0.023 A, and two components of the D2 
line with A = 0.021 A. Figure 20.9 shows a modern spectrum with the highest available 
resolution, obtained by exciting resonance fluorescence in a beam of sodium atoms 
with a very narrow-band, variable frequency dye laser. The method is further discussed 
in Chap. 22. 

As already pointed out above, heavy-ion accelerators have made it possible to pro- 
duce highly-ionised ions even of very heavy atoms. When heavy ions are accelerated 
to energies in the range of several GeV, their strongly-bound inner electrons can be 
stripped off on passing through a thin foil. Thus, for example, 82-fold positively- 
charged ions of the atom g3Bi could be prepared. 

In such heavy ions, the hyperfine splittings are many times larger than in the H 
atom, as might be expected from Sect. 20.3. The wavelength for the transition between 
the F=5 and F=A hyperfine terms of the Is state of Bi^^"^ (nuclear spin / of 
^^^Bi = 9/2) is thus about 245 nm. This corresponds to roughly 40000 cm“^ in con- 
trast to the value of 0.0475 cm“^ for the F= i-^F = 0 transition in the H atom as 
shown in Fig. 20.7. See I. Klaft et al.: Phys. Rev. Lett. 73, 2425 (1994). 

20.5 Hyperfine Structure in an External Magnetic Field, 

Electron Spin Resonance 

An important tool for the study of the hyperfine interaction is the measurement of the 
splitting of the lines of the optical spectrum in the presence of an external magnetic 
field ^0- This external field adds to the internal field and the resulting term split- 
ting depends on the relative sizes of the two fields. If the external field is so small that 
the magnetic potential energy of the atom in it is small compared to the energetic 
separation of the hyperfine terms, one speaks of the Zeeman effect of the hyperfine 
structure. 

In this case, the coupling of the angular momentum vectors I and J to F remains 
intact. If the external field Bq is strong enough, this coupling is lifted and one speaks 
of the Paschen-Back effect of the hyperfine structure. The transition from the Zeeman 
to the Paschen-Back effect occurs in most cases at much smaller fields than in the case 




Fig. 20.10. Hyperfine structure in a magnetic field. The vector diagram 
illustrates the Zeeman effect: the total angular momentum F, composed of 
the angular momentum vectors J and /, possesses quantised orientations 
relative to an applied magnetic field Bq. Only the z component of F is ob- 
servable 
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of the fine structure, due to the small magnitude of hyperfine splitting. With respect 
to hyperfine structure, 0.1 T is usually already a “strong” field. 

The case of the hyperfine Zeeman effect is represented in Fig. 20.10. The angular 
momentum vectors I and / remain coupled. The resulting total angular momentum 
vector F precesses around The precession frequency of I and J about F is much 
higher than that of F about Bq. There are 2F-f 1 possibilities for the orientation of the 
vector F relative to which are given by the quantum numbers rrif. As usual, the 
relation \F\z=niFh with mf=F, holds. For optical transitions, the 

selection rules AF = 0, ± 1 and Amf= 0, ±1 apply. 

The size of the Zeeman splitting in the presence of the hyperfine interaction can be 
calculated from Fhfs= ~Mf ' Sq, quite analogously to the calculation of the Zeeman 
effect with fine structure in Sect. 13.3. In this case, jUfis equal to jll juj. We shall thus 
restrict ourselves to stating the result: the shift of the atomic terms due to the hyperfine 
splitting in a magnetic field is given by (weak field case) 

^Eiips = (20.13) 

with 

^ ^ F(F+1) + /(/+!) -7(7+1) ^ //N F(F+l) + 7(7+l)-7(/+l) 

2F(F+1) //B 2F(F+1) 

The second term can be neglected relative to the first term because of the factor 
= 1/1836. The term splitting in a weak field then yields 2F+ 1 equidistant com- 
ponents; see Fig. 20.11. 

If the applied field is increased in strength, the first effect is a decoupling of the 
angular momenta 7 and /. The LS coupling is stronger and remains in effect, since it is 
determined by the magnetic moments of electrons, while the IJ coupling results from 
an electronic moment and a much weaker nuclear moment. Therefore, a relatively 
small applied field suffices to break up the IJ coupling; the hyperfine Zeeman effect 
then is replaced by the hyperfine Paschen-Back effect. The angular momentum vector J 
of the electrons precesses about the z direction defined by the field ^o- The nuclear 
moment ///is primarily affected by the electronic fiddly, since this is, in general, much 
larger at the nuclear site than the applied field; the magnitude of 5/ is 10 to 100 T. The 




Fig. 20.11. Hyperfine structure of an S ^/2 state with /= 1, 
without applied field (I); in a weak applied field, corre- 
sponding to the Zeeman effect with hyperfine structure (II); 
in a strong applied field, the Paschen-Back effect with hy- 
perfine structure (III). In making the transition from a weak 
to a strong applied field, the Zeeman effect of the hyperfine 
structure thus becomes the hyperfine structure of the 
Zeeman effect 
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Fig. 20.12. Hyperfine structure in a magnetic field. Vector diagram of the 
Paschen-Back effect. A total angular momentum F is no longer defined 



vector I thus precesses about the direction of J. Since, however, the precession of J 
about Bq is much more rapid, the nuclear moment experiences a constant component 
offiyin the direction of5o- The rapidly oscillating components in the x and y direetions 
average to zero; the final result is that the nuclear angular momentum vector / is also 
oriented in the direction of The precession frequencies of I and J about the z 
direction differ, however. The corresponding vector diagram is shown in Fig. 20.12. 

The quantum number Fis no longer defined. The orientation quantum numbers m / 
and irijdxt valid for the vectors / and J. The splitting energy of the atomic terms is the 
sum of three components. One of these is the shell moment, gjn^B^rrij, which was 
treated before under the Zeeman effect without hyperfine structure (Sect. 13.3). Each 
of these levels is split into (27+ 1) hyperfine levels, corresponding to the possible values 
of mi'. m = -I. The quantum number I can thus be immediately deter- 

mined by counting, as is made clear in the right side of Fig. 20.11. The selection rules 
for optical transitions are Atrij = Q,±i and Arrii = 0. The magnitude of this second 
splitting can be easily calculated using the vector model (Fig. 20.12). To a first approx- 
imation, it is anijmj, where 




Fig. 20.13. Hyperfine splitting (sche- 
matic) for J= 3/2, /= 3/2 in a weak, 
medium, and strong field. The sum rule 
mp = mj+mj must be obeyed. This 
follows from the conservation of angular 
momentum. (After H. Kopfermann) 
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a= (20.14) 

l/7(7+l) 

is the hyperfine constant defined in (20.10). 

If the field is strong enough, the effect of the external field on the nucleus is no 
longer negligible compared to the field of the shells. Therefore the Zeeman energy of 
the nucleus, - is included in (20.15) as the third term on the right. 

In all, the splitting energy in a strong field is 

= gjjUBmjBo-\-amjmj- gjfi^rrijBo- (20.15) 

The region of transition between the limiting cases of strong and weak fields is usu- 
ally very difficult to calculate, and can only be approximated. Figure 20.13 shows sche- 
matically the complicated behaviour of the terms in the intermediate field range for a 
state with the quantum numbers J = 3/2 and 1= 1/2. A further remark concerning 
(20.15): it would be consistent to treat the electronic ^-factor ^y(and naturally also gs 
and gi, Sect. 12.4, 12.2, and 13.3.5) as negative, since the electron spins are directed 
opposite to the corresponding magnetic moments. In this case, a minus sign is to be in- 
serted in (20.15) in front of gj. 

In ordering the levels with different possible values of mj, one must be careful. 
If the electronic field Bjai the nucleus is larger than the external field Bq, the energies 
of the nuclear states depend on the orientation of the nucleus with respect to the elec- 
tron. However, the quantum numbers refer to the direction of the external field Bq, 
which is not the same as that of the electronic field for all states. This is illustrated by 
Fig. 20.14, which shows the splitting of the ground state of the hydrogen atom in a 
magnetic field (compare Fig. 20.7 and Eq. 20.15). From left to right, the Zeeman ener- 
gy of the electron in the field Bq, the hyperfine interaction energy between the electron 
and the proton, and the Zeeman energy of the proton are ordered according to decreas- 

Fig. 20.14. The hyperfine struc- 
ture of the hydrogen atom in a 
strong magnetic field Bq , and the 
hyperfine structure observable in 
electron spin resonance (ESR). 
From left to right, the three con- 
tributions to the splitting accord- 
ing to (20.15) are illustrated, 
along with the magnetic quantum 
numbers and the allowed (mag- 
netic) dipole transitions. At the 
lower left, the ESR spectrum of a 
free electron is shown; at the 
lower right, that of an electron 
bound to a proton (H atom). 
From the splitting in the ESR 
spectrum, one obtains the hyper- 
fine interaction constant a (in the 
limiting case of a strong applied 
field ^o)- The selection rules for 
magnetic dipole transitions are 
Am^ (or Arrij)=±\, Amj=0 
for ESR, and Arrij= ±1, Arrij 
(or Am^) - 0 for nuclear mag- 
netic resonance (NMR) 
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Fig. 20.15. The ESR spectrum of hydrogen atoms which are stored in inter- 
stitial sites in a KCl crystal. The two outer groups of lines belong to the 
transitions with Am^ = \ and mj= + 1/2 or = - 1/2 of the light hydro- 
gen isotope (mass number 1 , } HX having a splitting H) of 500 Gauss; cf. 
Fig. 20.14. Each of these transitions is split into 13 equidistant lines, due to 
what is termed the superhyperfine structure, i.e. the interaction with the 
nuclear moments of the four nearest-neighbor d“ anions (/= 3/2). The 
three inner groups of lines belong to the transitions with Am^ = \ and 
rrij=\,0, or -1 of heavy hydrogen (jH, deuterium, with 7=1 and 
Qj = 0.857). The splitting constant a(^H) is smaller by the ratio of the Qj 
factors than that of (see J. M. Spaeth: Phys. stat. sol. 34, 71 (1969). In 
the limiting case of a strong external field Bq, the hyperfine coupling 
constant a' measured by this method is identical to the isotropic hyperfine 
constant a as in (20.11) 



ing interaction energy. The arrows indicate the spin directions. We note again that for 
the electron, the spin and the magnetic moment are antiparallel to one another, but are 
parallel for the proton. 

Up to now, we have mainly treated the observation of the hyperfine splitting in the 
optical spectral region, i.e. with electric dipole transitions and the selection rules 
AF = 0, ± 1 and Amp = 0, ± 1. However, in Fig. 20.14 the magnetic dipole transitions 
which are observable by electron spin resonance (ESR) are indicated. For these, the 
magnetic field interacts directly with the magnetic moments, and different selection 
rules apply: Ami= ±1 orZlm^^ ±1.A single resonance frequency for the free elec- 
tron (at the left in Fig. 20.14) becomes two transitions in the hydrogen atom due to the 
hyperfine and Zeeman interaction with the proton. Their splitting corresponds to about 
0.05 T (500 G). The electron thus experiences a field of about 500 G from the proton. 
By comparison, the nucleus, according to (12.26), is subjected to a field of the order 

0.05 • = 30 T . 

/^proton 



The transition region between a weak and a strong applied field can only be calculated 
in closed form for the case F= /± 1/2 by using the Breit-Rabi formula. We shall not 
treat this region further here. 

The electron spin resonance of free hydrogen atoms is difficult to observe in the gas 
phase due to the thermal motion of the atoms and their low density. However, hydro- 
gen atoms may be stored in alkali-halide crystals in interstitial lattice sites; using such 
crystals, one does, in fact, obtain the ESR spectrum expected from Fig. 20.14. An ex- 
ample is given in Fig. 20.15. It also shows the ESR spectrum of heavy hydrogen (deute- 
rium), iH. Due to the nuclear spin /= 1, one here observes three ESR transitions; be- 
cause of the smaller ^/factor (gi= 0.857), a and thus the hyperfine splitting are smaller. 
Figure 20.15 exhibits a further splitting, the so-called superhyperfine structure. This is 
due to the interaction of the electronic spin with the nuclei of host-crystal atoms on ad- 
jacent lattice sites. Further details are given in the figure caption. 
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20.6 Direct Measurements of Nuclear Spins and Magnetic Moments, 
Nuclear Magnetic Resonance 

We have seen in the previous sections that the examination of the hyperfine structure of 
lines of optical spectra makes possible the calculation of nuclear spin. However, the 
magnetic moment can only be measured in connection with the electronic field at the 
nucleus by this method. Since the electronic field cannot be measured independently of 
the nuclear spin, other methods are required to determine the magnetic moment. 

The Stern-Gerlach method (see Sect. 12.6) of measuring magnetic moments by 
deflection in an inhomogeneous magnetic field can be used to measure the nuclear 
moment if the electron shell of the atom is diamagnetic. Otherwise the deflection due to 
the nucleus will be completely obscured by the approximately thousand-fold greater 
effect of the electron-shell moment. In 1933, Stern, Frisch and Estermann used this 
method to obtain the correct order of magnitude of the proton magnetic moment in 
hydrogen molecules. They were not able to make a precise measurement, however. In 
their apparatus, the molecules had a flight path of 150 cm, and a deflection of 10"^ 
mm. The inhomogeneity of the field was 80000 G/cm or 800 T/m. 

The atomic or molecular beam resonance method of Rabi (1937) is much more 
precise. Here, as in electron spin resonance (Sect. 13.2), one measures the Larmor fre- 
quency of the nuclear spin in an external magnetic field, or in other words, the ratio 
jjLj/I. For this method, too, the field due to electronic shells in general must vanish at 
the nucleus. This condition is fulfilled for some atoms, including Hg, Cu, C and S, and 
for some molecules, e.g. H 2 O, CaO, LiCl, CO 2 , H 2 and NH 3 , because in these atoms 
and molecules, the magnetic moments of the electronic shells add up to zero. 

For the potential energy of the nuclear moment in the field Bq, 

V= -Mr Bo 

oCOS (I, Bo) 

= (20.16) 

The energy difference between two neighbouring orientations in the field Bq, i.e. for 
Amj= ±1 is then (Fig. 20.2) 



AE = QiMn^o • 



(20.17) 



If the sample is irradiated with radiation at a frequency corresponding to this energy 
difference. 



v = ^Bo 



QJl = 



h 

h 



or 



^0= y^o> 



(20.18) 



i.e. the Larmor frequency of the nucleus in the field Bq, one induces transitions with 
Ami= ±1 and the nuclear spin changes its orientation. If numerical values are sub- 
stituted, the resonance condition is 



v[Hz] =762.3 Bo [G] . 



(20.19) 
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Fig. 20.16. Atomic beam resonance according to Rabi. The mag- 
netic fields A and B are inhomogeneous, while magnet C has a 
homogeneous field Bq . A high-frequency field B^ is applied perpen- 
dicular to Bq (S, source; D, detector) 



Fig. 20.17. Data curve from the Rabi atomic beam resonance ex- 
periment. The intensity at the detector is at a minimum (for a con- 
stant frequency of the field B ^ ) when the homogeneous field Bq of 
magnet C fulfils the resonance condition. The curve shown here is 
for fluorine nuclei 



Resonance is detected in the molecular beam experiments by the deflection of the 
atoms or molecules in inhomogeneous magnetic fields. The apparatus is represented 
schematically in Fig. 20.16. The apparatus is adjusted so that when the resonance field 
is not turned on, the particle beam passes through the three magnets A, B and C and 
is detected at the detector D. The deflection in the inhomogeneous field of magnet A is 
thus exactly compensated by deflection in magnet B. However, if the resonance field 
within the homogeneous magnet C (S, is perpendicular to Bq) causes some nuclei to flip 
from one orientation to another, then the displacement in the field of magnet B is no 
longer symmetric to that in the field of magnet A. The affected particles thus are no 
longer picked up by the detector D. The current in the detector D is measured as a func- 
tion of the frequency of the field B\, or if B\ has a constant frequency, then as a func- 
tion of the field strength of the magnet C. When the resonance condition is fulfilled, 
the current in the detector is a minimum. Fig. 20.17. To calculate the nuclear moment 
fij itself, one must have the nuclear spin I from other measurements, most often from 
the hyperfine lines in the optical spectrum (Sect. 20.3). 

The Rabi atomic or molecular beam resonance method permits the measurement of 
ratios of resonance frequencies to great precision. The exact value of the field Bq does 
not have to be known. The ratio of the resonance frequencies of the electron and the 
proton, for example, was measured by letting particles with pure electron spin proper- 
ties pass through the same apparatus as the molecular beam with pure nuclear spin 
properties. The ratio of the moments 



y»ei 



9'electron /fi^proton 



could then be determined. Because the g factor of the electron is very precisely known 
from other experiments, it is possible to measure that of the proton, and from it the 
proton magnetic moment, very precisely. For protons, Rabi obtained the moment 
bi= (2.875 ± 0.02) This value can now be used as a secondary standard for other 
nuclei or for the exact measurement of magnetic fields Bq. 
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Fig. 20.18. Diagram of a simple 
nuclear spin resonance appara- 
tus. The sample is placed in a 
glass tube between the poles of a 
homogeneous magnet. The high- 
frequency field is applied 
across a bridge and an induction 
coil. For better detection of reso- 
nance, the ^0 can also be 
modulated by an additional coil 



In 1946, Purcell and Bloch showed both experimentally and theoretically that the 
precessional motion of the nuclear spin is largely independent of the translational and 
rotational motion of the nucleus, and that the method of nuclear spin resonance can be 
applied not only to free atoms, but to atomic nuclei in liquids and solids. 

The principle of nuclear magnetic resonance in the condensed phase is explained in 
Fig. 20.18. With the applied magnetic field Bq fixed, one varies the frequency of a sig- 
nal generator which produces the radio-frequency field B \ . When the resonance condi- 
tion (20.19) is fulfilled, i.e. when the frequency of the generator is identical to that of 
the nuclear Larmor precession, energy is absorbed from the radio-frequency field by 
the sample: this energy is taken from the induction coil containing the sample {Purcell), 
A power meter, used as receiver, indicates this energy absorption. The resonance may 
also be detected by observing a resonance signal induced in a second coil placed perpen- 
dicularly to the first {Bloch). Often, a fixed frequency is used, and the magnetic field is 
varied in order to sweep through the resonance. 

For protons, with Qj= 5.58 (Sect. 20.2), the resonance frequency in a field Bq= IT 
is calculated from (20.18) or (20.19) to be v = 42.576 MHz. This corresponds to a quan- 
tum of energy equal to zl^* = /?v = 1 .8 • 10 eV. Because of this small energy splitting, 
all of the Zeeman levels of the nuclear spin system are nearly equally populated accord- 
ing to the Boltzmann distribution at room temperature in thermal equilibrium. Thus, in 
resonance, nearly the same number of absorption and emission transitions are induced, 
and only a small nett effect remains. At room temperature, the population difference 
between two nuclear Zeeman levels in a field of IT is about 10~^ of the populations 
themselves. This extremely small population difference is responsible for the nuclear 
resonance signal. In order to obtain high sensitivities, at present the strongest possible 
magnetic fields are employed, using superconducting coils; e.g. 12 T, giving 
V = 500 MHz for a proton spin. 

Relaxation processes (compare Sect. 13.3.7) re-establish the population differences; 
otherwise, the resonance would quickly become saturated, i.e. the levels would all be 
equally populated and no further signal would be observable. 

In modern nuclear magnetic resonance spectrometers, the stationary or continuous 
wave methods described above are no longer used; instead, pulse methods, as described 
in Sect. 14.4 and 15.4, are employed. In these methods, one observes the time evolution 
of the (nuclear) magnetisation I{t) following a short, intense high frequency pulse, and 
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Fig. 20.19. Pulsed nuclear magnetic resonance. (Upper 
part) High-frequency pulses of frequency cuq are used to 
excite the resonance, e.g. 90° pulses. The measurement is 
repeated many times and summed (signal averaging). 
(Middle part) The time evolution I(t) of the>^ component of 
the nuclear magnetisation. My, is observed. The envelope 
decays with the time constant 7^ when, as here assumed, 
only one species of equivalent nuclear spins (e.g. protons) 
are present in the sample. This time constant, which we have 
denoted by 7^ for simplicity, is more precisely called T*, 
since it may contain 7] in addition to the transverse relaxa- 
tion time Ti defined in Sect. 14.5. (Lower part) Through 
Fourier transformation of the observed time spectrum I(t), 
one obtains the nuclear magnetic resonance spectrum I(co), 
which is a single line in the case of one equivalent species of 
nuclear spins 



calculates the frequency dependence of the magnetisation, /(a»), from the observed 
temporal behavior. 

This is illustrated in Fig. 20.19 for a simple case. After a 90° pulse (cf. Sect. 14.4), 
which turns the magnetisation away from the z direction and into the xy plane, the y 
component of the magnetisation exhibits an exponential decay with the time constant 
T 2 . The Fourier transform of this decay is given by 

4 +00 

^(0 = — 1 
2n _oo 

and represents the nuclear resonance spectrum /(co). If several inequivalent types of 
nuclei are present, the time evolution I{t) becomes more complex. 

20.7 Applications of Nuclear Magnetic Resonance 

The precision with which one can determine NMR resonance frequencies is often limit- 
ed by precise knowledge of the magnetic field Bq. The magnetic moment of the proton 
is now known to very high precision, so that, conversely, NMR can be used for very 
precise magnetic field measurements. Accuracies of better than 10 are attainable 
with this method. 

There are numerous other applications of NMR. In nuclear physics, it is used to de- 
termine gyromagnetic ratios and thus, when the nuclear spin / is known, magnetic mo- 
ments Pi. The most important applications of NMR are, however, in chemistry. More 
details on this topic are given in H. Haken and H.C. Wolf: Molecular Physics and 
Elements of Quantum Chemistry, Chap. 18. 

When an atom is chemically bonded, the influence of Bq on the electronic shells of 
the bonding partners causes a characteristic shift of the NMR frequency of its nucleus. 
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Fig. 20.20. Nuclear magnetic resonance (NMR) of ethyl 
alcohol, C 2 H 5 OH. The signal intensity at the receiver is plotted 
against the transition frequency. The latter is measured in ppm 
(parts per million) relative to the frequency of a suitable stan- 
dard. In the upper portion of the figure, one sees three lines 
having the intensity (area) ratio 1:2:3. They are due to proton 
spins in the -OH, the -CH 2 , and the -CH 3 groups, respec- 
tively, having 1, 2, and 3 protons. The resonance frequencies of 
these protons differ by a few ppm due to the differing chemical 
bonding in the three groups. In the lower portion, the same 
spectrum, but with higher experimental resolution, is shown. 
The - CH 2 signal is now split into a quartet of lines by the inter- 
action with the - CH 3 protons, while the signal from the latter 
is split into a triplet by the indirect nuclear spin interaction with 
the -CH 2 protons. The protons of the -OH groups are 
rapidly exchanged between different molecules, which averages 
out the indirect spin interactions and leaves a single resonance 
line 



The electrons, depending on the bonding, shield the applied field to varying degrees. 
The change in the resonance frequency due to this diamagnetic shielding is called the 
chemical shift. It is measured in ppm (parts per million). Thus, two resonance lines are 
shifted by 1 ppm in a spectrometer with ^0 = 1 T and v = 42.5 MHz if their frequencies 
differ by 42.5 Hz. 

As an example. Fig. 20.20 shows the proton NMR signal from ethanol. Even at low 
resolution (upper portion), one obtains three resonance maxima, since there are three 
different chemical shifts corresponding to protons bonded in the -CH 3 , -CH 2 , and 
— OH groups of C 2 H 5 OH. The areas under the resonance curves are proportional to 
the number of nuclei at that resonance frequency. With extremely homogeneous mag- 
netic fields and well-defined high-frequency signals, a considerably higher resolution 
can be attained (lower part of Fig. 20.20). One then observes that the three resonance 
lines are split into several resolved sublines. This structure is a result of indirect nuclear- 
nuclear interactions (coupled through the electrons) among all the protons in the mole- 
cule (called / coupling). This splitting will not be further treated here; it contains infor- 
mation about the structure and the bonding in the molecule under study. 

NMR is thus of great importance for the elucidation of structures in chemistry and 
in solid state physics. Other applications are found for example in the investigation of 
motion in liquids and solids, in reaction kinetics, and in analytical chemistry. 

In recent years it has become possible to obtain images of the interior of a (non- 
metallic) body using NMR. One thus obtains information about the spatial distribution 
of certain atomic nuclei, e.g. protons. Furthermore, spatial inhomogeneities in the nu- 
clear spin relaxation times which may be present can also be determined. This has 
opened a promising field in diagnostic medicine and in-vivo NMR of biological sys- 
tems. Under the name Magnetic Resonance Tomography, this method has recently 
come to be applied in practical medicine, where it complements and enlarges the possi- 
bilities of x-ray diagnostic methods. 
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Spatially resolved NMR is based upon (20.18, 19), according to which the resonance 
frequency cu of a particular nuclear species is a unique function of the applied magnetic 
field ^ 0 - To obtain an image of spatial structures, one requires NMR signals which 
contain spatial information in coded form. For this purpose, magnetic field gradients 
are used; this is illustrated by the following discussion, using proton NMR as an exam- 
ple. 

In a homogeneous magnetic field Bq, the resonance condition co = yB^ is fulfilled 
for each proton at the same frequency cu, independently of the location of that proton 
within the sample. The NMR signal therefore contains no spatial information. How- 






Fig. 20.21 a -c. Nuclear resonance of three samples with the same nuclei, e.g. protons, in a field gradient; 
illustrating position-resolved nuclear magnetic resonance, (a) In a field gradient dB/dx, each position jcj , Xi , 
and JC3 corresponds to a different field strength 8 ^, 82 , and 8 ^ . Samples 7, 2, and 3 (with 2 larger than 1 and 
3) are thus in regions of differing field strength, (b) Through the resonance condition co = yBq , each position 
x^.Xi, or JC3 corresponds to a particular resonance frequency cuj , cu 2 , or CU3 . (c) The intensity of the NMR 
signal is a measure of the number of nuclei with the given resonance frequency (cui , cu 2 , <^3 ). A unique corre- 
spondence between the measured spectrum and the spatial arrangement of the nuclei exists 

Fig. 20.22. The principle of position-resolved NMR. {Upper part) A cylinder W filled with water, in an in- 
homogeneous magnetic field Bq , and surrounded by input and receiver coils, HF. {Centre part) The free in- 
duction decay (FID) of the protons in the cylinder W following a high-frequency pulse. {Lower part) The 
Fourier transform of the FID signal I{t) yields the spectrum 7(ca). When the proton density in the cylinder is 
constant, one thus obtains information about the geometry of the sample, here a section along the length of 
the cylinder 
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ever, when a field gradient is superposed on the homogeneous field Bq, so that the field 
strength and thus the resonance frequency varies from place to place (Fig. 20.21, upper 
part), then protons at different locations will have different resonance frequencies (Fig. 

20.21 , centre). The spatial information is translated into a frequency scale, and the pro- 
ton NMR signal contains information about the location of the protons (Fig. 20.21, 
lower part). An image of the spatial structure becomes possible if one knows the rela- 
tionship between the field strength B and the location within the sample, and measures 
the signal amplitude as a function of frequency. 

If, for example, a cylinder filled with water is placed in a field gradient dB/dx (Fig. 

20.22, upper part), then the resonance frequency of the protons in layers perpendicular 
to X increases with increasing jc from left to right. It has proved expedient in practice to 
measure not the resonance signal as a function of frequency, /(cu), but rather the time 
evolution of the signal following a 90° pulse, i.e. the free induction signal (compare 
Sect. 15.4). The measured function I{t) (Fig. 20.22, centre) can be converted by Fourier 
transformation to the desired function, /(cu): 



+00 



—00 



The result is indicated in Fig. 20.22 (lower part). The observed free-induction decay 
following a 90° pulse contains the information about the time behaviour of the proton 
spins with frequencies in the range a>±Aco fulfilling the resonance condition (20.18), 
where A oj is the frequency spread of the high frequency field B\ of centre frequency a>. 
The amplitude is proportional to the proton density. If, as in Fig. 20.22, the proton 
density is constant, the shape of the cylinder, projected onto the plane perpendicular to 
the X direction, will be imaged. 




Fig. 20.23. Magnetic resonance tomography of 
a human head. (Kindly placed at our disposal 
by Prof. K. H. Hausser, Heidelberg) 
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In order to investigate the spatial structures of objects with more complex shapes 
and inhomogeneous proton densities, one requires many projections in different direc- 
tions. From these, the desired image can be reconstructed with the aid of a computer. 
The principle of the measurement can thus be simply understood; in practice, some- 
what different measurement procedures have been adopted. We shall not treat these 
here. 

In biological samples, in general not only the densities of particular nuclei vary 
from place to place, but also their relaxation times and T 2 (see Sect. 14.5). They de- 
pend, for example, on the local temperature and the local state of motion of the struc- 
tures being studied. By introducing the relaxation times as an additional indicator in 
position-resolved NMR and by using pulse methods for the measurement, it is possible 
to obtain images with strong contrast even from organs which show little contrast in 
x-ray images, e.g. in soft tissues. 

An example from diagnostic medicine is shown in Fig. 20.23. 

A further important modern application of resonance techniques is the caesium 
atomic clock, used as a time and frequency standard. ^^^Cs is the only stable isotope of 
caesium. It has a nuclear spin / = 7/2, and thus in the atomic ground state with 7=1/2 
the total angular momenta F = 4 and F= 3. 

With the help of an atomic beam resonance apparatus (see Fig. 20.16), one observes 
transitions between the hyperfine components F= 4, mp=0 and F= 3, mf= 0 in a 
weak applied field Bq, Fig. 20.24. The transition frequency is practically independent 

of^o. 

All the essential components of an atomic clock are similar to those of an atomic 
beam resonance apparatus following Rabi (cf. Sect. 20.6): a furnace which generates a 
Cs atomic beam; a polarisation magnet which separates the atoms in the F = 3 state 
from those in the F = 4 state; a resonator, in which atoms are made to undergo transi- 
tions from the F= 4, mf=0 state to the F= 3, mp = 0 state by induced emission, 
caused by irradiation at the resonance frequency of 9.192631770 GHz; a second magnet 
which serves as analyser and passes only atoms in the F= 4, m/r= 0 state; and a detec- 
tor. On resonance, one observes a minimum at the detector. 




Bo 



Fig. 20.24. A portion of the term scheme of the Cs atom 
in the ground state as a function of a weak applied mag- 
netic field Bq . The transition frequency used for the Cs 
atomic clock corresponds to the transition between the 
states F = 3, rrip = 0, and F=4, mp=0 
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The frequency v which is fed into the resonator is stabilised to the minimum in the 
detector. This frequency may be reproduced with an accuracy of ca. 10"^^. The most 
precise atomic clocks, which are used as primary frequency and time standards, are to 
be found in the laboratories of the Physikalisch-Technische Bundesanstalt in Braun- 
schweig, F. R. Germany; of the National Bureau of Standards in the USA, and of the 
National Research Council in Canada. 



20.8 The Nuclear Electric Quadrupole Moment 

In the year 1935 Schuler and Schmidt found, in studying the hyperfine structure of the 
optical spectra of the isotopes ^^^Eu and ^^^Eu, that this structure cannot be explained 
through the magnetic interaction between the nuclear dipole moment and the inner and 
applied magnetic fields alone. They found deviations from the interval rule (20.12) 
which are caused by an electrostatic interaction between the non-spherically symmetric 
atomic nucleus and inner electric fields, more precisely by the interaction between the 
nuclear electric quadrupole moment and the electric field of the electronic shells. 

Up to now, we have regarded the nucleus as spherically symmetric. If that is the 
case, it is surrounded by a spherically symmetric force field. The next less simple, 
possible charge distribution is that of an electric dipole; a static electric dipole is, how- 
ever, not found in nuclei (due to conservation of parity in the nuclear states). 



Fig. 20.25. Charge distributions in nuclei. If the nucleus is spherically 
symmetric, Q = 0. In the case of a cigar-shaped (prolate) nuclear 
charge distribution, Q is positive; for a pincushion shape (oblate), Q 
is negative. As a rule, large nuclear quadrupole moments are positive. 
The symmetry axis of the distribution is the direction of the nuclear 
spin vector / 

In contrast, there are many nuclei - in particular, all nuclei with spins 1 - 
whose form is clearly not spherical, but rather is an ellipsoid of rotation (Fig. 20.25). 
The electric potential of a homogeneously charged ellipsoid of rotation is no longer 
spherically symmetric; it is, instead, described by an electric quadrupole moment (in 
a first approximation). 

The electric quadrupole moment Qq of an arbitrary axially symmetric charge Qq 
distribution g{r) is defined as integral over the volume r 

eQo = \(3i:^-r^)Qir)dr , (20.20) 

T 

where r is the distance from the centre of the charge distribution and C is the coordinate 
in the direction of the axis of rotational symmetry. If such a charge distribution 
is brought into an inhomogeneous electric field, described by the potential V 
(E = -gradF), the energy levels are shifted as a result of the electrostatic inter- 
action by 

AEq = -6Qo^ f-cos^ e-i/l) , 

4 dz^ \2 / 




( 20 . 21 ) 
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where 6 is the angle between the z axis of the potential (likewise assumed to be axially 
symmetric) and the rotational-symmetry axis of the charge distribution. 

The quadrupole moment obviously vanishes for a spherically symmetric charge dis- 
tribution. Q>0 corresponds to a prolate (cigar-shaped) ellipsoid and Q<0 to an oblate 
(pincushion-shaped) ellipsoid (Fig. 20.25). Within the nucleus, the charge distribution 
is assumed to be homogeneous. The dimension of the quadrupole moment Q in nuclear 
physics is that of an area (cm^ or barn), see (20.20). 

In order to use (20.21) in the case of a free atom, we must take account of the fact 
that in this case, the rotational-symmetry axes of the quadrupole moment and of the 
potential are collinear with / and with J, respectively. The spectroscopically observed 
quadrupole moment Q is thus not equal to the value Qo from (20.20), which 
characterises the deviation of the nuclear charge distribution from spherical symmetry. 
Owing to the precession of the nucleus, a time-averaged effective quadrupole moment 
Q is instead observed; it has the value 



Q = Qo 



1 ( 21 + 1 ) 
(J+l)(2J+l) 



( 20 . 22 ) 



As a result, Q = 0 for 1=0 or 7= 1/2. Numerical values of Qq lie in the range 
1000- 10“^® cm^ for the nucleus 0.28- 10“^® cm^ for ^H, 0 for *H and '®0, and 
— 19.5- 10”^® cm^ for ®^Cu. Therefore, the angle 9 can be expressed in the vector 
model, again with the help of the law of cosines, analogously to the considerations we 
have used in the cases of fine structure and hyperfine structure. The result is 



AE - cQ (3/4)C(C+ !)-/(/+ 1)7(7+ 1) 
® ^ dz^ 2/(27- 1)7(27-1) 



(20.23) 



with the abbreviation C = F(F+ !)-/(/+ !)-/(/+ 1). The hyperfine terms are 
shifted further (Fig. 20.26) by AEq and the interval rule is therefore violated. 



F 

5/2 




+ D/4 



-D 
+ 5D/4 



Fig. 20.26. Quadrupole shift of the hyperfine levels for a 
state with J - \ and / = 3/2. The shift indicated corre- 
sponds to a positive quadrupole moment. The constant D is 
equal to the factor {eQ){b^ V/bz^) in (20.23). The expected 
splitting pattern, given by the interval rule, is modified by 
the quadrupole interaction 



A quantitative treatment of quadrupole shifts is difficult, for the most part. To 
determine the quadrupole moments of nuclei, one needs to know the field gradient 
6^F/0z^. Only rather large field gradients, as are found in molecules or in solids (but 
which are larger than can be produced in the laboratory), yield term shifts which are 
sufficiently large to allow an exact measurement. More precise determinations of 
nuclear quadrupole moments may be obtained from the direct measurement of nuclear 
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quadrupole resonance in molecules and in solids, by scattering of charged particles 
from nuclei (Coulomb excitation), by high-frequency resonance in an atomic or molec- 
ular beam, and by the study of muonic atoms (Sect. 8.7). 

The quadrupole shift must be taken into account in the analysis of hyperfine struc- 
ture in high-resolution atomic spectroscopy in crystals: there, information can be ob- 
tained in particular about the internal electric field gradients, and thus about the distri- 
bution of the electrons between the atoms or molecules. It is, to be sure, difficult to 
calculate the field gradients which are produced by the polarisation of inner, closed 
electronic shells through the fields from the outer electrons in open shells. The correc- 
tions to the electric field gradient required by this effect (socalled Sternheimer factors) 
can be only imprecisely calculated. More accurate results are obtained from modern 
ab initio computations (e.g. the LMTO-ASA method). 



Problems 

20.1 ^^^Bi has an excited state with the electronic configuration which is split 
into 6 hyperfine levels. The distances between each pair of levels are 0.236, 0.312, 
0.391, 0.471 and 0.551 cm”^ How large is the nuclear spin quantum number / and 
what is the hyperfine structure constant a? Sketch the position of the hyperfine levels 
relative to the unsplit line in units of a, 

20.2 Calculate the magnetic field generated at the nucleus by the 1 5, 25 or 35 electron 
of the hydrogen atom. How large is the energy difference between parallel and anti- 
parallel orientations of the proton and electron spins in these states? 

20.3 From the data on the hyperfine splitting of the D 2 line in the sodium spectrum. 
Fig. 20.8, calculate the hyperfine constant a and the field at the nucleus for the ^Pyi 
state. 

20.4 Assume that hydrogen in the ground state is in a magnetic field of 0.3 tesla. Cal- 
culate according to Sect. 20.5 and Fig. 20.14 the Zeeman energy of the electron, the 
hyperfine interaction energy and the Zeeman energy of the nucleus. 

20.5 ^^Mg atoms in the ground state (^Sq) are examined by the Rabi method. In a field 
^ 0 = 0.332 tesla, a resonance frequency v= 3.5 MHz is measured. ^^Mg nuclei have 
spin 1= 5/2. Calculate the gyromagnetic ratio y, the g factor and the largest compo- 
nent of the magnetic moment of the nucleus in the direction of the field (in nuclear 
magnetons). 

20.6 In an atomic beam experiment similar to the Stern-Gerlach experiment, a beam 
of ^^a (^Si/ 2 ) atoms is passed through a strong, inhomogeneous field B\ 
(jubBi> Eufs)- The beam is split into 8 parts. What is the nuclear spin quantum num- 
ber of ^^a? Into how many parts would the beam split in a weak inhomogeneous 
field? 
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21.1 Some Basic Concepts for the Laser 

The laser has become a light source without which modern spectroscopy could not exist 
(see also Chap. 22). In addition, the processes which lead to emission of laser light are a 
beautiful example of the application of basic knowledge about interactions between 
light and atoms. For both reasons we shall treat the laser in detail here. 

The word “laser” is an acronym for “/ight amplification by stimulated emission of 
/•adiation”. As we shall see below, however, a laser not only amplifies light, but in most 
cases acts as a very special kind of light source, which emits light with properties unique 
to lasers, namely: 

1) A particularly high degree of monochromaticity (temporal coherence). Line- 
widths Avof the order of ten hertz can be attained. This means, in the visible region, 
that the relative linewidth Av/vis < 10 “ The coherent wave train then has a length / 
of 300000 km (the coherence length is calculated from I = cAt = c/Av), in contrast to 
the light from conventional sources, which consists of a “spaghetti” of light wave trains 
about 1 m long. 

2) A very pronounced bunching of the light, which is practically limited only by dif- 
fraction effects at the exit window of the laser (spatial coherence). 

3) A high radiation intensity, which can be as high as 10*^- 10^^ W in pulsed 
operation. Together with 1) and 2), this results in a very high photon flux density in a 
very narrow spectral range. 

4) The possibility of generating ultra-short light impulses (down to lO"’"* s) of high 
intensity. 



Laser 

Pump 




Fig. 21.1. Elements of a laser 



The action of a laser can best be understood by considering its elements (Fig. 21.1). 
It consists, in principle, of a piece of “laser-active” material, which can be a solid, such 
as a ruby (i.e. a crystal of AI2O3 doped with Cr ions). The chromium ions, which are 
responsible for the red colour, undergo the optical transitions which make the laser 
process possible. Other examples are glasses or garnets which contain neodymium as 
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Fig. 21.2. Standing wave between the mirrors of the laser 
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Fig. 21.4. Typical pumping schemes. Above, atom with two levels. 
Middle, atom with three levels: left, the upper (optical) transition is 
laser active; right, the lower (optical) transition is laser active. 
Below, atom with 4 levels. The middle (optical) transition is laser 
active 



the active lasing atoms. Laser processes can also take place in gases or in certain dye 
solutions. Dye lasers are especially important for practical applications because they 
can be continuously tuned. The laser is bounded at two ends by mirrors. If we think of 
the light as a wave, we recognize that standing waves can only form between two 
mirrors if a multiple of one half the wavelength exactly fits between them (Fig. 21.2). 
On the other hand, if we consider the photon nature of light, then the photons which 
move in the axial direction are reflected back and forth between the mirrors, while 
those moving in any other direction quickly leave the laser (Fig. 21.3). 

Finally, the laser must be “pumped” by addition of energy from outside. Different 
pumping methods can be used, depending on the atoms. In the 3-level scheme shown in 
Fig. 21.4, the laser atoms are excited to state 2 by optical pumping, i.e. by irradiation 
with light. From there, the atoms can fall to state 1 by emission of a light quantum. The 
same figure shows other simple possibilities for pumping. The helium-neon laser de- 
monstrates a particularly interesting method of bringing about the occupation of excit- 
ed states. The pumping process in this case is explained in the captions to Figs. 21 .5 and 
21.6. Figure 21.7 shows an example of a construction plan for a helium-neon laser. 

The basic idea of a laser is as follows: We imagine that a considerable number of 
atoms have been brought into the excited state by optical pumping. Then photons can 
be spontaneously emitted. If the photon interacts with another excited atom, it can 
induce this atom to emit another photon. Through repetition of this process, an 
avalanche of photons builds up. The process cannot go on ad infinitum, since more and 
more of the atoms go into the ground state. If energy is constantly supplied by pump- 
ing, an equilibrium is reached. We must now examine the requirements for excitation 
conditions and radiation properties which must be met in order for the atom to be laser 
active. For this activity, the spontaneous emission of light, called fluorescence, must be 
replaced by induced, collective emission, which results in the typical characteristics of 
laser light. 
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Fig. 21.5. Excitation and recombination processes in the helium- 
neon laser (schematic). The symbols Is, 2s,... used in the neon 
spectrum on the right are arbitrary technical terms (Paschen 
symbols) explained in the legend of Fig. 21 .6. The helium atoms are 
excited to the or state by collisions with electrons in a gas- 
discharge tube. Because the excitation energies correspond to the 
“25” and “35” states of neon, this energy can be transferred from 
helium to neon by atomic collisions. The electron in the 35 or 25 
state in the neon then goes into the 3/? or 2/? state, and laser light 
emission may be associated with this process {solid arrows). The 
electron then drops back to the 1 5 state by emission of another light 
quantum. Finally the neon atom loses the remainder of the energy 
and drops back to the ground state by collisions with other atoms. 
In reality, the term scheme of neon is much more complicated than 
shown here (Fig. 21 .6). Note that the energy of the ground state was 
chosen as the zero point of energy here 



[10'’ cm-’] 




Fig. 21.6. Splitting and transition scheme for the energy levels of 
helium {left ordinate) and neon {centre). The ground state of neon, 
which is not shown here, has the electron configuration 
\s^2s^2p^. In the states shown, one of the six 2p electrons is 
excited. If we give only the configuration of the six outermost elec- 
trons, then the abbreviations I 5 , 2s, and 35 stand for 2p^3s, 
2p^ 4s and 2p^ 5 s, respectively. Each of these states consists of 4 
terms. The abbreviations 2p and 3p stand for 2p^ 3p and 2p^ 4p, 
each of which has 10 terms 




Fig. 21.7. Example of a gas laser. The most important part is the 
discharge tube which contains the helium-neon mixture. The 
surfaces at the end are placed to form an angle less than the 
Brewster angle. The mirrors, which are curved to give the light 
modes more stability, are located outside the tube. The rest of the 
parts serve as supports 



In the following, we shall discuss the important case in which only a single type of 
photon is generated by the laser process, namely photons which move in the axial 
direction and exactly fit between the mirrors as a standing wave. It is also assumed that 
the basic frequency of the light is in resonance with a transition frequency of the atoms. 
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A laser which has these properties is called a “single-mode” laser. In the next section we 
shall develop the theory of this laser in terms of the photon model, and then in Sect. 
21.3, in terms of the wave model. 



21.2 Rate Equations and Lasing Conditions 

First we consider the rate equations of the laser. It is most useful here to think of the 
light as photons. As we have already observed, there are different types of photons, 
e.g. those moving parallel or obliquely to the axis of the laser. We would expect the 
axially oriented photons to be most likely to contribute to the laser process, since they 
remain between the mirrors for the longest time and thus have the greatest chance of 
inducing emission. We let « be the number of photons under consideration and observe 
how this number changes with time. The number of photons increases as a result of the 
induced emission. According to Einstein, the rate of increase is proportional to the 
number of atoms in the excited state N 2 and to the number n itself (see Sect. 5.2.3). It is 
also proportional to a transition probability per second, W. Of course, atoms in the 
lower state of the optical transition can absorb photons, thus removing them from cir- 
culation. The photon number is reduced by WN^n. The rate of the spontaneous 
emission is independent of the number of photons present, but it is proportional to the 
number of excited atoms: WN 2 . Finally, the photons can escape through the mirror or 
be lost by scattering. The rate of loss is proportional to the number n of the photons. If 
we assume that the lifetime of a photon in the laser is to, the loss rate is - n/t^. We thus 
obtain the scheme: 



Process 


Rate 


Stimulated emission 


WN2U 


Absorption 


- WN^n 


Spontaneous emission 


WN 2 


Losses 


-n/tQ 



Summarising these expressions, we obtain the rate equation for the number of photons: 

— =W{N2-Ni)n+WN2- — . ( 21 . 2 ) 

dt to 

Using his idea about photons, Einstein showed in the derivation of the Planck 
radiation formula that W is given by 



W = 



1 

VD{v)Avx 



(21.3) 



Here V is to be interpreted as the volume of the lasing material, zl v is the linewidth of 
the electron transition between levels 2 and 1 in the atom, and t is the lifetime in level 2. 
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D(y)Av is the number of standing waves in a unit volume in the frequency range 
V. . . v + Zl V. D{v) is given explicitly by 



D(v) = Sn- 






(21.4) 



where v is the frequency of the atomic transition. With (21 .3), it is possible to derive the 
laser conditions. As we have mentioned, a photon avalanche must be generated. This 
means that only those terms which are proportional to n in (21.2) are of interest. Fur- 
thermore, a more exact mathematical analysis shows that spontaneous emission, repre- 
sented by the term WNi, is not correlated with the actual laser light, and only gives rise 
to noise. 

To derive the laser condition, we therefore leave out this term on the right side of 
(21 .2). We now obtain the laser condition by demanding that the rate of photon genera- 
tion is greater than zero: 



^ = W{N2-Ni)n- — > 0 . 
dt ^0 



(21.5) 



Since the photon number is assumed not to be equal to zero, (21.5) leads directly to a 
relationship for the necessary “inversion” per unit volume. 



Ni-Nx inv^AvT 
V c^to 



( 21 . 6 ) 



Equation (21.6) is the laser condition. In order to obtain laser activity, we must there- 
fore bring enough atoms into the excited state by optical pumping, so that the differ- 
ence in the occupation densities (A 2 -A^i)/F satisfies the condition (21.6). The smaller 
the right side of (21 .6), the more easily laser activity can be obtained. The right side sets 
conditions on the atoms to be used. In order to get by with the smallest possible 
pumping power, the atoms must have as narrow a linewidth as possible. Furthermore, 
the laser condition becomes more and more difficult to satisfy as the laser frequency v 
increases. In addition, the lifetime to of the photons in the laser must be made as large 
as possible by using the best available mirrors, to can be estimated as follows: \/to can 
be taken as the rate per second at which photons leave the laser. This rate is naturally 
proportional to the velocity of light. Because the time of flight between the mirrors is 
proportional to the distance between them, the probability of loss is inversely propor- 
tional to the length L. Finally, the probability of exit is inversely related to the reflec- 
tivity R of the mirror. This yields the factor (1-i?). In this way, we arrive at the 
formula 



to L 



(21.7) 



for the lifetime of the photons. 

The overall dynamics of the laser radiation can only be described when, in addition 
to (21.2), we have similar equations describing the occupation numbers for the atomic 
states. These are based on the pumping scheme shown in the middle section of 
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Fig. 21.4. The rate of change of the occupation number N 2 is given by the following 
equation, which we shall justify intuitively: 

/"I \T 

1 = - \vN2n + WNi n + W2 qNq- W12N2 • (21 .8) 

dt ' -V- ' ^ ' 

coherent interaction pumping and 

recombination 

The first two terms on the right side describe the changes in the occupation number Ni 
due to stimulated emission and absorption. The third term on the right side shows how 
this occupation number is increased by pumping from the ground state (level 0), and 
the last term represents changes in the occupation number caused by competing pro- 
cesses which do not contribute to the laser process, such as non-radiative recombina- 
tion from state 2 to state 1. (By the way, the expression “W 20 ” is read “Wtwo zero”» 

^twenty •) 

One is now easily convinced that the rate equations for the other two levels have the 
form 



= WN2n - WNi n + w^N2 - WniM 
dt 



(21.9) 



and 



^ - W20^0+ WoiM , (21 .10) 

dt 



assuming that we ignore direct recombination from state 2 to state 0. We can obtain an 
essential clue to the solutions of these equations by considering a special case. We 
assume that the transition from state 1 to state 0 is extremely rapid, so that level 1 is 
practically unoccupied, and we can set = 0. Of the equations (21 .8 - 10), we are then 
interested only in (21.8), which we write in the form 



= - WN 2 ti + Wi 2 (A^ 2 - N 2 ) with N 2 = — No . 
dt Wj2 



( 21 . 11 ) 



Here N 2 is the value of N 2 which would be obtained through pumping and relaxation 
processes alone, i.e. if n = 0. The expression for the photons also simplifies to 



dn n 

= fVN2n - — . 

dt to 



( 21 . 12 ) 



We now convince ourselves that a laser can attain a steady state, i.e. that it can emit 
light continuously. This requires that we set 



dt dt 

From (21.12), we obtain the relationship 

Wto 



(21.13) 



(21.14) 
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This says that a definite occupation number 7V2,th (th means threshold) is established 
during the laser process, and that it remains constant, even if we pump in more energy. 
The pump energy must therefore be converted primarily into photon energy, as we 
actually find if we solve (21.11) for n: 



n = 



^12 

w 



N, 



N- 



- 1 



2,th 



(21.15) 




Fig. 21.8. The emission behaviour of a 
single-mode laser. Abscissa: pumping power 
input. Ordinate: emission power (in a single 
mode). Below the threshold, i.e. the critical 
pumping power, the light consists of spon- 
taneously generated wave trains (“noise”). If 
this is ignored, the emitted radiation is equal 
p to zero. Laser activity begins at the threshold 



If the average number Ni is increased by increasing the pump rate W 20 , then accord- 
ing to (21.15) the number of photons increases. Since the photon number cannot be 
negative, the laser process does not begin until a critical value of the inversion or occu- 
pation number N 2 ,th is reached. This accounts for the curve given in Fig. 21.8. Below a 
certain pumping power, or inversion, laser light is not emitted. Above this power, the 
number of photons emitted increases linearly. In actual data curves, there is emission 
below the threshold, but this is due to the fact that the atoms can emit light spontane- 
ously, and this light is “noise” as far as the laser is concerned. 



21.3 Amplitude and Phase of Laser Light 

In the preceding section we treated the single-mode laser in terms of the photon model. 
This made it possible to derive the lasing conditions and to determine the number of 
photons present in the laser. We know, however, that light is characterised not only by 
its intensity, but by its phase relationships, which account for such phenomena as inter- 
ference. It has been shown experimentally that there are limits to the ability of wave 
trains to interfere with each other. Let us consider a finite wave train, which we divide 
into two by means of a half-silvered mirror. We allow the two beams to traverse dif- 
ferent paths, and then allow them to interfere. Obviously this is only possible if the dif- 
ference in the lengths of the paths is less than the length of the wave train. Let us now 
consider the light from an ordinary lamp, in which each atom emits its wave train inde- 
pendently of the others. Since the emission time (= lifetime of the excited state) is 
finite, the wave train has only a finite length. The light field generated by the lamp thus 
consists of individual, uncorrelated wave trains. 
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The “coherence length” of the light is the average length of a wave train. In the fol- 
lowing, we shall examine the question of whether the light from a laser fits this descrip- 
tion, only with much longer wave trains, or if it has basically different properties. To 
answer this question we construct the following laser model, which can be rigorously 
grounded. 

Let us consider a light wave between the two mirrors. It has the form of a standing 
wave, and the light field strength F can be written 



F = £'tot(0 sin/:x 



where 

, nm 

k = . 

L 



(21.16) 

(21.17) 



Here L is the distance between the mirrors and m is an integer. In a purely harmonic 
oscillation, would have the form 



Eioiit) = lE^zoscot , (21.18) 

where oj is the atomic transition frequency. Actually, however, we must expect that 
phase shifts arise in the course of time, and that the amplitude Eq also is not constant. 
For example, we know that in the case of spontaneous emission from a single atom, Eq 
decays exponentially. For this reason, instead of (21.18) we write 

= (21.19) 

and thus obtain for (21.16) the general trial solution 

F= [^(Oe”'^^+c.c.] sinA:x. (21.20) 

An equation for the complex amplitude E{t) is derived in laser theory. We attempt 
to arrive at this equation by an intuitive process as follows. The light field amplitude E 
is increased by the process of induced emission, with the amplification being propor- 
tional to the light field amplitude E and to the number of excited atoms N 2 . The pro- 
portionality factor is called g. The light field amplitude is also subject to a symmetrical 
decrease, due to absorption, which occurs at the rate -gN^E, 

Furthermore, the light can leave the laser through the mirror or be scattered in some 
way. The result is a loss rate for the light field amplitude expressed by - kE, Finally, 
the light field strength is also subject to constant change due to spontaneous emission 
events. This is expressed as the so-called “fluctuating forces” f{t), which reflect the 
statistically fluctuating emission events. We thus obtain the equation 

dF 

— = 9(N2-N,)E- KE+f(t ) . (21.21) 

dt 

As we saw in the preceding section, the inversion (A^ 2 ~^i) is changed by the laser 
process. On the one hand, {N 2 ~Nx) is brought to a value of (A^ 2 “M)pumped by the 
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processes of pumping and relaxation - this is called the unsaturated inversion. Atoms 
are constantly being removed from the upper level by the laser process, so that the 
actual inversion {Ni-N^) is less than the unsaturated inversion. If the laser intensity is 
not too high, one can assume that this reduction is proportional to the intensity \E^ of 
the light, so that 



N2-7V, = (N 2 -M)pu^ped- const |£1L (21.22) 

Equation (21 .22) is called the saturated inversion. If we substitute (21 .22) in (21 .21), we 
obtain the fundamental laser equation 

dF 

— = (G-K)E-C\E\^E+f(t). (21.23) 

dt 



Here G = S^(A 2 ~^i)pumped is a gain factor, and C = gr- const. 

To arrive at an intuitive understanding of the meaning of (21.23), we can take 
refuge in mechanics and think of £' as a coordinate ^ of a particle. If we introduce an 
acceleration term m^q on the left side of (21.23), we obtain 

rriQq + q = K{q)+f , (21.24) 



where we can think of m as being so small that it can be ignored. Equation (21.24), 
however, is the equation for the damped oscillation of a particle subject to the forces K 
and /. We can calculate K from a potential V: K = -dV/dq. The potential field is 
shown in Fig. 21.9. For G < k, i.e. for low pumping power, the upper curve applies. 
Our fictitious particle is repeatedly pushed up the potential curve by the fluctuating 
force /, and then falls back to the equilibrium position q = 0 under the influence of the 
force K. If we identify q with E, this means that each push represents the beginning of 
an emission event in the atom. After the event, the field strength again decays to zero. 
Since the pushes are directed to the left and right with equal frequency, the average 
position of the particle is equal to zero, or in other words, the average field strength E is 
equal to zero. 

Now let us consider the case in which G>K.ln this case the lower curve of Fig. 21 .9 
applies. It is obvious that the original position ^ = 0 has become unstable and is to be 
replaced by a different q. On the other hand, however, the pushes exerted on our ficti- 
tious particle drive it around among these new equilibrium positions, so that q has the 
form q = r^^- g{t). Now comes an important point. We spoke before of a real particle 
coordinate. In reality, however, the field amplitude in (21.23) is complex. 

This means that in order to complete the picture, we must think of the potential 
field being rotated about the V axis (Fig. 21.10). The fictitious particle in this picture is 
rolling in a valley, in which it is pushed either in a radial or a tangential direction by the 
fluctuating forces. Because there is no restoring force in the tangential direction, the 
particle carries out a sort of diffusion motion. If we apply this to laser light, we obtain 
the following picture: 

The complex amplitude can be broken down according to the rule 

i0(O 



E = [A*o+^(0]e 



(21.25) 
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Fig. 21.9a, b. The “effective potential” for the laser amplitude q = Fig. 21.10. The “effective potential” for the complex 

E. a) Potential for pumping power below the threshold. laser amplitude £■ = + iqrj above the threshold 

b) The same above the threshold 



where is the new stable amplitude, q is the amplitude fluctuation, and the factor 
exp[i0(f)], which describes the tangential diffusional motion mentioned above, is the 
phase diffusion. As is shown in laser theory, the average value oi decreases with in- 
creasing laser intensity I according to <^^> ~ 1/7. The equation (21.25) represents an 
infinitely long wave train, which merely undergoes small fluctuations in phase and 
amplitude. Phase diffusion is responsible for the finite linewidth, which also decreases 
with 1/7. We recognise from this that the properties of laser light are fundamentally 
different from those of light from normal lamps. Laser light has a stable amplitude and 
its linewidths are very sharp. The theoretical values naturally depend on the laser 
materials, but for typical materials they are of the order of 1 Hz. The linewidths 
observed with precision lasers are about 10 Hz. 



Problems 



21.1 Calculate W for a ruby laser with the following parameters: 

F=62.8cm^ 
v = 4.32- 10’^ Hz 
z1v = 2.49-10"Hz 
c = 2.9979 • 10‘\m/s 
T= 3.0 ms 

21.2 Calculate t^ for the following cases: 

Resonator length 7- = l,10orl00 cm; 

Reflection coefficient R = 99, 90 or 10%. 
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21.3 Using the rate equation (21.2), calculate the exponential rise time for the 
parameters calculated in Problems 21.1 and 21.2. The population difference (N 2 ~^i) 
should be taken to be constant with the value 10^"^. 

21.4 Using the laser conditions, calculate the critical inversion density for ruby. The 
following data apply: 

F=62.8cm^ 
v = 4.32- 10‘'*Hz 
Zlv = 2.49-10'^ Hz 
T= 3.0 ms 

c' = 1.70- 10*° cm (velocity of light in the medium) 

R = 99% 




22. Modern Methods of Optical Spectroscopy 



22.1 Classical Methods 

We have come to know optical spectroscopy in the preceding chapters as the most 
important method for the investigation of the electronic shells of atoms and for under- 
standing atomic structure. We have seen that in many cases, extremely small splittings 
or shifts of the spectral lines must be determined with a high spectral resolution. 
Progress in our knowledge of atomic structure and of the fundamentals of quantum 
mechanics has only been possible through a continuous process of improvements in 
experimental techniques. 

In this chapter, we shall first treat the question of how a high spectral resolving 
power can be attained. If J A is the wavelength difference which two closely spaced 
spectral lines may have and still be recognised as two lines - i.e. resolved - then the 
quotient A/ZIA is called the spectral resolving power of the apparatus. (We note that a 
large value of this quotient implies a “high” resolving power in the above sense.) 

With prism and grating spectrographs, which were developed nearly to perfection in 
the first quarter of this century, it is difficult to obtain a resolving power significantly 
above a few x 10^. Diffraction limits the resolving power of a grating spectrograph to 
the product Nm, where N is the total number of grating rulings and m is the spectral 
order. Both numbers cannot be increased without limit while maintaining a measurable 
light intensity. Nevertheless, an extremely high resolving power may be reached by 
observing in a high order with a correspondingly shaped (“blazed”) grating, to be sure 
in a limited range of wavelengths. If resolving powers of 10^ and above are required, an 
interferometer must be used. The most important type is the parallel-plate interfero- 
meter of Fabry and Perot (Fig. 22.1). 

This interferometer consists of two half-silvered glass plates which are set up 
parallel to each other at a distance of several cm. Through multiple reflections, inter- 




chromatised Half-silvered Image 

light glass plates plane 



Fig. 22.1. Fabry-Perot interferometer. The interference fringes at a 
given angle which are produced between the two plane-parallel glass 
plates yield a system of concentric rings in the image plane. The usable 
range of wavelengths is very small; for this reason, the light must be 
pre-monochromatised with a prism or a grating monochromator 
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ferences of high order m occur. If the spacing of the plates is 2.5 cm, then the path length 
difference between neighbouring rays is equal to 5 cm. For light of wavelength 500 nm, 
this corresponds to an order m = 10^ since m = (5 • 10"^m)/(5 • 10"^m). The number 
of rays N which can interfere is limited by the reflectivity of the mirrors and is of 
order 10. The resolving power is thus about 10^ according to the formula k/A k = m- N, 

With such spectrographs, the line shapes or the splitting behaviour in a magnetic 
field are investigated for individual spectral lines. For this reason, usually a simple 
monochromator to pre-select the light is used in front of the Fabry-Perot interfero- 
meter. In the image plane of the interferometer, a system of concentric rings is 
observed. For monochromatic light, these are the neighbouring interference orders, 
e.g. m = 1000000, 1000001, 1000002, etc. The intensity distribution in the interference 
pattern may be recorded photographically or photoelectrically. The wavelength range 
may be increased somewhat by changing the spacing of the plates or the gas pressure 
between the plates. 

In the past 10 years, completely new methods of ultra-high-resolution optical spec- 
troscopy have become possible as a result of developments in laser technology. In the 
following sections, we shall treat these methods in more detail. 



22.2 Quantum Beats 



In this section, we shall treat a modern spectroscopic method which has come to be 
known by the name “quantum beats”. This method permits the resolution of closely 
spaced neighbouring levels. To understand the principle, let us imagine an atom with 
three levels, which we shall denote by the indices 0, 1, and 2 (Fig. 22.2). The energy 
levels 1 and 2 are supposed to be closely spaced, with a frequency spacing denoted by 
oj \2 = (E 2 -Ei)/h. The atom is at first taken to be in the ground state 0 and is then 
excited by means of a brief light pulse. The pulse length r is chosen so that it fulfils the 
relation l/cui 2 . The centre frequency (“carrier frequency”) coq of the pulse corre- 
sponds to the transition frequency from level 0 to one of the levels 1 or 2. If we suppose 
the pulse to be expanded in sine waves, these have frequencies for the most part in the 
range - o)\ 2 + ojq to + cui 2 + ojq. This light pulse can cause an electron to undergo a 
transition from the ground state into one of the two states 1 or 2, or, more generally, 
into a superposition of the two states. A superposition of two wavefunctions may be 
interpreted, as we saw in Sect. 16.1.4, as an oscillation, for example of a dipole, with 
the frequency cui 2 * If the electron now radiates light, the exponential decay of the 
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Fig. 22.2. Quantum beats. Left: Sche- 
matic energy level diagram. Right: Ex- 
perimental set-up. The atom is excited by 
an incoming light pulse. The emitted 
radiation is measured in another direc- 
tion with a photomultiplier. Typical ex- 
perimental results are shown in Fig. 22.3 
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radiated intensity from spontaneous emission is modulated at the above oscillation fre- 
quency u)\ 2 - Applying the statistical interpretation of quantum mechanics, this pheno- 
menon is often explained as follows: the excited electron, when it happens to be in state 
1, can emit a photon with the frequency cuio = (£'i-£'o)/^; in state 2, it can emit a 
photon of frequency To these two light quanta, there corresponds a beat frequen- 
cy 0)12 = <^20“ thus we arrive at the name “quantum beats”. 

We now give a more exact quantum mechanical treatment. Using quantum theory, 
we must take as the wavefunction of the excited electron at time t = 0 

y/(r,0) = ax0xir) + cci0iir ) , ( 22 . 1 ) 

where 0/ is the wavefunction of the electron in the level / = 1 or 2. The a’s are coef- 
ficients which are determined by the initial excitation process. In the time that follows, 
the electron will undergo a transition to the ground state, accompanied by the emission 
of a photon; in the process, the occupation probability in the levels 1 and 2 decreases 
exponentially with a decay constant IF, while the ground state becomes populated. The 
corresponding wavefunction is denoted by 0 q. This implies the state in which the elec- 
tron is in the ground level and, additionally, a photon is present. The total wavefunc- 
tion therefore takes the form 

xf/ir, t) = a\ exp( - \Ex t/h - Ft) 0i (r) -f ai exp( - xEit/h - Ft) 02 (r) + a^it) 0o(r) . 

( 22 . 2 ) 

The coefficient ao(0> which is especially interesting for the present application, can 
be quantum mechanically determined. Unfortunately, we cannot, for reasons of space, 
give the Wigner-Weisskopf theory, which is applicable to this determination, here. We 
therefore simply show the result, according to which the coefficient consists of the 
two parts 

ao(0 = Ci(/) + C2(/). (22.3) 

in which the coefficients Cj have, in principle, the form 

Cj(t) = ajC ■ . (22.4) 

Thus the coefficient Cj depends on the initial coefficient aj, as well as on the polarisa- 
tion e of the emitted photon and on the dipole moment Ooj which connects the ground 
state with the excited state y; a is a proportionality factor which need not interest us 
further here. As may be immediately seen, the coefficient Cj vanishes at ^ = 0, but 
reaches a maximum value at later times. The probability of finding the electron in the 
ground state grows continuously until it reaches 1 . The probability of finding a photon 
present is given by |aoP* If we use the sum (22.3) and the explicit form (22.4), we can 
see that \ has the form 

|aoP= e“^^[yl+Bcosa;i2(/+0)] . (22.5) 

Here, A and B are time-independent constants. Through the superposition of the 
emission from the states 1 and 2, an oscillating contribution with the frequency 

= (E 2 -E\)/h is produced. If (22.5) is plotted as a function of time, the modulated 
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decay curve shown in Fig. 22.3 is obtained. The frequency coi 2 , and thus the energy dif- 
ference of the two excited levels, can be determined from experiment, which is used to 
obtain precision measurements of this energy difference. 



22.3 Doppler-free Saturation Spectroscopy 

As we have already seen in Sect. 16.2, the spectral lines from gas atoms are broadened 
because of the Doppler effect. Since this Doppler broadening is, in general, much 
greater than the natural linewidth, it represents a considerable obstacle to the measure- 
ment of narrow spectral lines. With the help of lasers, it has been found possible to 
avoid Doppler broadening. To understand how, we recall the chapter on laser theory, 
considering in particular the gas laser. There we saw that because of laser action, the 
originally inverted populations of the atoms (i.e. N 2 -N^> 0) become less inverted. 

Now imagine that we can distinguish the individual atoms in a gas laser according to 
their velocities in the axial direction of motion. Each group of atoms with velocity v has 
a particular occupation number Aj ^ or A^ 2 ,d the lower or in the upper state, respec- 
tively. If we multiply the occupation number A^ 2 ,y» which is supposed to be the same for 
all groups of atoms v, by the Maxwell velocity distribution (16.73), we obtain a profile 
like that shown in Fig. 22.4. If we now imagine that the laser light is produced with a 
particular frequency co, it will only be able to interact with those groups of atoms v for 
which the Doppler shift is 

\(o- o)o-\- coov/c\ < y , ( 22 . 6 ) 

where y represents the order of magnitude of the natural linewidth of the atomic transi- 
tion, and coq is the transition frequency of the atoms at rest. Due to this interaction, the 
inversion of this particular group v of atoms will be lowered, i.e. the number of atoms 
in the upper level is reduced. In this manner, the originally unperturbed population 
density has a “hole burned” in it (Fig. 22.5). 

Because of the laser tube, we are dealing with a standing wave, which may be 
thought of as being composed of two travelling waves. The wave travelling in the one 
direction is then in resonance with atoms having a velocity v = c(l- co/coq), the other 
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Fig. 22.4. The number of excited atoms N 2 with velocity 
V is represented as a function of v by a Gaussian curve, 
according to the Maxwell distribution 




Fig. 22.5. This figure shows the effect of a standing laser 
wave on the distribution of excited atoms. If we imagine 
the standing wave to be composed of two travelling 
waves, each of these components interacts with a group 
of atoms whose Doppler frequency just coincides with 
the laser-wave frequency. Therefore, in general, two 
holes are burned in the distribution. If these two holes 
overlap, the notch in the distribution is especially deep 



is in resonance with atoms having a velocity v = -c(\- co/ coo), where v is the velocity 
component along the axis of the laser tube. In this way, two holes are burned in the 
population inversion distribution. If the laser frequency a> exactly agrees with the fre- 
quency of the atomic transition coq, the two holes coincide and a particularly deep 
notch is cut out of the distribution. This is clearly the case for those atoms whose veloc- 
ity component along the laser axis is zero, or, more precisely, for which \ ojQv/c\<y. 
The intensity emitted by the laser naturally depends upon the population inversion; 
thus, the intensity goes through a minimum when the laser light is exactly in resonance 
with the transition frequency of the atoms at rest, v = 0. This effect is known as the 
“Lamb dip”. It was, by the way, independently and simultaneously predicted theore- 
tically by Haken and Sauermann. Using the Lamb dip, the position of a spectral line 
can clearly be much more precisely determined, namely to the order of magnitude of 
the natural linewidth. To be sure, this method is limited to media which are themselves 
laser active. Furthermore, in the Lamb dip an additional source of broadening, called 
power broadening, plays a role; we shall not discuss it in detail here. 

A decisive improvement was suggested by Letokhov: the substance which is to be 
investigated is brought, in gas form, between one of the mirrors and the laser-active 
medium. When the gas atoms, which once again have a Maxwell velocity distribution, 
are illuminated by the intense laser beam, those which fulfil condition (22.6) absorb 
strongly. Thereby, an appreciable fraction of these atoms is pumped into the excited 
state(s), which decreases the population in the ground state and thus reduces the ab- 
sorption. We have burned a hole in the absorption line. If we now suppose that the 
laser light and the central frequency for the atomic transition are in resonance, we find 
the picture shown in Fig. 22.6. Above the Doppler broadened line, a very narrow line is 
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Fig. 22.6. Burning a hole in the absorption line produces a peak in 
the emitted radiation, which is shown here as a function of the 
frequency of the laser field. The width of the peak corresponds to 
the natural linewidth 



produced, with a width of the order of the natural linewidth of the material inves- 
tigated. In general, the frequencies of the laser light and the absorbing material will not 
be exactly the same. But even then, there is a sharp line of enhanced total emission to be 
observed above the Doppler broadened line. 

Figure 22.7 is a diagram of an experimental arrangement. With such methods it has 
been possible to observe Lamb shifts, such as the one mentioned in Sect. 12.11, 
optically (refer to the spectrum shown in Fig. 12.24). There are a number of variations 
of the method discussed here, but we must refer the reader to the literature on them (see 
the list of references for this chapter at the end of the book). 



Modulation 

Tunable Beam amplifier 




Fig. 22.7. Diagram of an apparatus for saturation spec- 
troscopy. The light from a tunable dye laser is divided 
into an intense saturation beam and a weaker test beam. 
These pass in opposite directions through a vessel filled 
with hydrogen atoms (according to Hdnsch, Schawlow 
and Series 



22.4 Doppler-free Two-Photon Absorption 

In an intense field of laser light, it is possible that an atom will absorb two photons. In 
this case the energy difference AE between the two levels is exactly twice the photon 
energy, i.e. 



AE = 2hv = 2hco, (22 J) 

An atom can also absorb photons with different frequencies oji and a> 2 , where the con- 
servation of energy requires that 



AE = ho)\ + hoj2- (22.8) 

In addition, we must take into account that the selection rules for two-photon absorp- 
tion are different from those governing one-photon absorption. For example, transi- 
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Fig. 22.8. a) Two-photon absorption scheme. We think 
of the standing laser wave as being broken down again 
into two waves moving in opposite directions. The part 
moving to the right interacts with atoms at a Doppler- 
shifted transition frequency a>-kv, where k is the 
wavenumber. The wave moving to the left, however, in- 
teracts with the same atom at a Doppler frequency of 
co + kv. When the two waves add, the velocity com- 
ponents cancel out. b) The shape of the resonance line 
in two-photon absorption. A is the Doppler width, 
and 2r the total width of the resonance line 



tions from one s state to another are now allowed. This can be justified, within the 
framework of perturbation theory, but we cannot undertake it here. 

We now imagine that we pass intense laser light, which forms a standing wave in a 
resonator, through a gas of the atoms being studied. If we pick out an atom with the 
velocity v in the axial direction, the frequency of a photon (oj) moving in one direction 
will appear, in the moving frame of reference of our atom, to be o>i = o»(l + v/c). The 
frequency of a photon moving in the opposite direction will be, in the atom’s frame of 
reference, a >2 = cu(l - v/c). Obviously the atom can absorb both photons, obeying the 
law of conservation of energy, if 

AE = hu>i + ha >2 = fico(i + v/c) + fico(l - v/c) = Ihco . (22.9) 

Since the velocity v of the chosen atom cancels out of this equation, all the atoms can 
carry out this two-photon process, so that a very intense absorption line with a line- 
width equal to the natural linewidth can form (Fig. 22.8). In this way the Doppler 
broadening can be cancelled out. Figure 22.9 is a diagram of an experimental arrange- 
ment. 

In the last chapter, especially, it was clear that atomic physics is still an area in 
which much research is being done. The new methods of high-resolution spectroscopy 
have been made possible by the laser, and they offer the possibility of examining 
atomic structure more closely and of carrying out entirely new kinds of experiments. 
The development is quite certainly still in progress. 



Photocell 




I Mirror 

Tunable dye laser Hydrogen atoms 



Fig. 22.9. Diagram of an experimental arrange- 
ment for two-photon spectroscopy. Two laser 
beams moving in opposite directions along the 
optical axis are passed through the gas being 
studied. By absorption of two photons (one 
from each of the oppositely propagating laser 
beams), an atom is raised to an excited state. Its 
recombination radiation is detected by the 
photocell, giving a measure of the strength of the 
absorption process 
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22.5 Level-Crossing Spectroscopy and the Hanle Effect 

The coherent excitation of two different excitation states of an atom, which was 
introduced in the section on quantum beats, is also the basis of another spectroscopic 
technique. The latter, however, can be done with a conventional light source or 
electron bombardment, and was first used in 1959 by Franken. It is called level-crossing 
spectroscopy. 

If atomic terms are split in an external magnetic field Bq (Chap. 13), then in the 
range of weak or medium fields, levels can cross. At certain field strengths 
therefore, two different Zeeman terms coincide energetically. Such crossing occurs 
when there is already fine structure splitting in the absence of an external field Bq, or 
when there are hyperfine multiplets with the same / value (Fig. 22.10). Let us consider 
two resonance transitions from the term splitting in Fig. 22.10. Both end at the 
common ground state, and both are excited simultaneously by the Doppler broadened 
line of the light source. Each excited state returns to the ground state by emitting 
resonant light. The intensity and polarisation of the emitted light can be calculated 
from the selection rules given in Chaps. 12 and 13. The sum of the individual intensities 
is observed by the photodetector. If we assume, for example that each state by itself 
emits a linearly polarised wave with amplitude coscui^ or a 2 COSoj 2 t, then the total 
intensity of the observed resonance light when there is no level crossing is proportional 
to a\^-a\, on a time average. 

If the two levels cross, however, the transition frequencies are the same. Both levels 
are coherently excited, and the light waves emitted from them interfere. The observed 
intensity is then proportional to {a^ + a 2 )^. There is thus a change in the spatial distribu- 
tion of the emitted radiation, and in the direction of observation established by the ar- 
rangement of the photodetector there is generally a change in the observed total inten- 
sity whenever the external field takes on a value B^ at which crossing occurs between 
two levels. An experimental observation is reproduced at the bottom of Fig. 22.10. 

The crossing fields B^, and thus the dependence of the terms on the field strength 
Bq, can be deduced from such changes in intensity (Fig. 22.10, upper part). The meas- 
urement of these term changes is an important experimental aid in the determination of 
fine structure and hyperfine constants. 

If linearly polarised light is used for excitation, it is possible to excite coherently two 
terms with the same transition frequency if the selection rule for one, for example is 
dm = -hi, while the rule for the other is dm = - 1. Both are fulfilled, because linearly 
polarised light can be decomposed into a and a cr~ component. The m- values of the 
level crossings are indicated in Fig. 22.10. 

The significance of the method for spectroscopic analysis of atomic excited states 
results in particular from the high spectral resolution. The precision with which the B^ 
values can be measured, i.e. the “linewidth” of level-crossing spectroscopy, is not 
limited by the Doppler width of the spectral lines, since here the coherence of the exci- 
tation within an atom is utilised. Rather the linewidth is limited essentially by the life- 
time of the excited state and is, in practice, at least several MHz. One thus obtains the 
same high spectral resolution as with high-frequency spectroscopy. As an example, the 
fine structure splitting 2 ^P^/ 2 ~ 2 ^Pi /2 in the ^Li atom could be determined to be 
10052.76 MHz. 

In a similar manner, the Hanle effect, discovered in 1924, can be thought of as a 
zero-field level crossing. Terms which are not split without an applied magnetic field 
“cross” in zero field, when the field is varied from negative values through zero to 
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Fig. 22.10. Level-crossing spectroscopy and Hanle effect on the transition ^5j/2-^P3/2 in Cu, I = 3/2 (from 
Bucka et ah). In the upper portion of the figure, the energy level diagram is shown; the splitting of the excited 
state ^^ 3/2 is given in units of the hyperfine structure constant A. The crossing points which are accessible to 
level-crossing spectroscopy are denoted by their values. The strength of the applied field Bq is plotted 
along the abscissa. In the lower portion of the figure, experimental curves for the Hanle effect {left) and for 
the level crossings {right) are shown. For experimental reasons, the derivative of the intensity w.r.t. magnetic 
field dl/dBQ is plotted rather than the intensity itself. The plots therefore are dispersion curves and the level- 
crossing fields B^ correspond to the zero-crossings of the curve, i.e. the points where it crosses the abscissa. 
At the left, one can see the Hanle effect at Bq = 0; on the right, the 3 level crossings, corresponding to the 
crossing points (-2,0), (-2,0) and (-3, - 1) in the term diagram. To convert the linewidth in Gauss to the 
value in MHz, (13.7) is to be used 

positive values. In zero field, degenerate atoms may be coherently excited for the same 
reasons as in level-crossing spectroscopy, assuming validity of the same selection rules. 
The coherence of the excitation and the interference of the emitted resonance light are 




Light source 



Fig. 22.11. Level-crossing spectroscopy and Hanle effect. Left, a schematic of the experimental setup. The 
atoms in the gas cell are excited by polarised light from a conventional light source and the resonance light 
(with or without polarisation analyser) is observed as a function of the magnetic field Bq. At level-crossing 
fields B^, the intensity and the polarisation of the resonance light detected by the photomultiplier change, in 
general. This is indicated on the right 
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removed when the degeneracy of the terms is lifted through the application of a 
magnetic field. Figure 22.10 also shows an example of a measurement of the Hanle 
effect. It is an important experimental method for determining the lifetimes of excited 
atomic states. A schematic of the experimental arrangement for observation of the 
Hanle effect and of level crossings is shown in Fig. 22.11. 



22.6 Laser Cooling of Atoms 



Photons can be scattered by atoms and, in the process, transfer momentum to them. 
In this scattering process, first an incoming light quantum is absorbed, and then a new 
light quantum is emitted in a direction which is in general not the same as that of the 
absorbed photon. If the energy of the photons corresponds to a transition frequency 
of the atom, a real absorption process occurs; otherwise, a so-called “virtual” transi- 
tion takes place. If a beam of atoms collides with a photon beam, the atoms are 
decelerated, since the photons initially have a preferred direction, while the re-emitted 
light quanta can be radiated in all directions. This braking effect is, to be sure, extreme- 
ly small for a single photon: the velocity of the atom is reduced by an amount on the 
order of one centimeter per second, while the atoms in a gas at room temperature move 
at about 1000 meters per second. This ‘scattering force’ was in fact discovered as early 
as 1933 by Otto R, Frisch, who used it to deflect a beam of sodium atoms. To this end, 
he first produced sodium vapor by heating the metal in a vessel, and then allowed the 
atoms in the vapor to pass out through a series of slits, forming an atomic beam. This 
beam was irradiated from the side by photons from a sodium vapor lamp. In this ex- 
periment, Frisch was indeed able to detect a small deflection of the atomic beam. 

In 1975, Theodor Hdnsch und Arthur Schawlow suggested the use of laser light for 
deceleration and thus cooling of atoms (Doppler cooling). In the same year, a similar 
scheme was proposed by David /. Wineland and Hans G, Dehmelt, in order to cool 
trapped ions using a laser. The basic idea is as follows: we first consider an atom which 
is moving from right to left and collides with a laser beam coming from the left. The 
frequency of the laser beam is adjusted to lie somewhat below the resonance absorption 
line of the atom. Due to the Doppler effect, the apparent frequency of the photons as 
seen by the atom is increased, so that the atom can absorb an oncoming photon and 
then re-emit another photon in a random direction. If the atom were moving in the op- 
posite direction, in contrast, the apparent frequency of the laser light would be shifted 
even further to the low-frequency side of the absorption line and the resulting light ab- 
sorption would be considerably weaker. This asymmetry between atoms moving to the 
left or to the right causes those moving to the left to be decelerated, while there is prac- 
tically no influence on those moving to the right. In order to cool atoms which are ini- 
tially moving in random directions, Hdnsch and Schawlow suggested that lasers could 
be set up to give opposing beams in each of the three spatial directions. An atom mov- 
ing in such crossed fields can be considered to be a Brownian particle subject to colli- 
sions with the photons. According to the theory of Brownian motion, the particle will 
be decelerated, but will acquire an average kinetic energy, i.e. an average temperature, 
through its interactions with the photon field. A rough estimate puts this temperature 
3i kT= hF/2, where k is the Boltzmann constant and /"is the linewidth of the atomic 
transition. 
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Because of the decelerating property of the photonic medium, it has been called 
“optical molasses”. Initial experiments seemed to confirm the theoretical expectations; 
atoms can in this way be laser cooled so that their kinetic energy is reduced to a small 
remainder, corresponding to a very low temperature. 

For sodium, a temperature of 240 pK was expected; instead, temperatures around 
40 pK, i.e. considerably lower values, were found. They could be determined from the 
spectral distribution of the scattered light (cf. also Fig. 22.14). To understand this 
discrepancy, we need to deepen considerably our understanding of the interactions of 
atoms with light. Furthermore, it was found that the lower temperatures were not a 
result of high laser intensities; on the contrary, the temperature decreased when the in- 
tensity was decreased. Clearly, several additional effects are involved in laser cooling. 

We shall describe here one such mechanism, first suggested by Cohen-Tannoudji, 
without excluding the possibility that additional effects also are involved. This 
theoretical approach starts from the fact that alkali atoms are not simple two-level 
systems, as was assumed in the theory of Doppler cooling. These atoms have several 
Zeeman sublevels in their ground states g, which are mutually degenerate in the absence 
of external fields. If they are irradiated with light of a particular polarization, this can 
excite an electron from a particular sublevel into an excited state, from which the elec- 
tron can return to a different sublevel of the ground state by emitting a photon. By 
means of this process, which belongs to the class of optical pumping methods (see 
Sect. 13.5), the population of the sublevels can be modified. The transition from one 
sublevel to another occurs with a particular rate constant, which is proportional to the 
intensity of the laser light irradiating the atoms. This rate constant, i.e. the number of 
transitions per second, can also be written as the reciprocal of an average transition 
time Tp from one sublevel to another. We shall make use of the time Tp in the follow- 
ing. 

Another important aspect of the theory of Cohen-Tannoudji is the fact that the op- 
tical interaction of an atom with the radiation field can bring about energy shifts. Here 
it is necessary to consider the atom and the radiation field with which it interacts as 
a total system in which a “dressed state” of the atom, i.e. an atomic state plus photons, 
can exist. Due to the interaction of an electron in a particular state with the laser radia- 
tion field, an energy shift d is present between the atomic ground state with a particular 
number of photons and the excited state, with one photon fewer. In addition there is 
another atom-radiation field interaction giving rise to a wave packet from the ground 
state and the excited state, in which (as in spin systems) the Rabi frequency Q plays 
a role. The Rabi frequency has a straightforward intuitive meaning, which we shall first 
explain in connection with spins. If a spin is located in a magnetic field which is con- 
stant in time, it can, as is well known, occupy one of two energy states, parallel to or 
antiparallel to the field. If now a second magnetic field of strength B is applied, which 
oscillates in a direction perpendicular to the static field at a frequency in resonance with 
the energy splitting of the spin states, the spin will flip with the frequency fiB/h 
(ju = Bohr magneton). This is the Rabi frequency. An analogous Rabi frequency is 
found when a coherent oscillating electric field which is in resonance with an electronic 
transition is applied to an atom: the electron undergoes transitions with the Rabi fre- 
quency 0E/h between the two electronic states involved. 0 is the dipole matrix element 
and E is the time-independent amplitude of the oscillating field at the atom. In fact, 
the Rabi frequency, multiplied by is a measure of a particular interaction energy be- 
tween the electron and the oscillating field. Combining the above-described effects, we 
find, as is shown by the theory, that the “dressed states” repel each other; the repulsion 
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decreases with increasing distance between the states when d is large, according to the 
formula Q^/2d, The size of the line shift in the atomic ground state is half of this 
value. The Rabi frequency is proportional to the field amplitude, so that the line shifts, 
like the pump rate l/ip, are proportional to the laser intensity /l. They also depend 
on the polarization of the laser beam and are, in general, different for each Zeeman 
sublevel. 

Finally, this new cooling mechanism depends on the existence of polarization gra- 
dients which inevitably occur in a three-dimensional optical molasses. Here, as an ex- 
ample of the new cooling mechanism, we consider a one-dimensional molasses, in 
which two counter-propagating waves have the same amplitude and linear polariza- 
tions which are orthogonal to each other (Fig. 22.12). In this kind of laser configura- 
tion, there are strong polarization gradients, since the polarization of the total field 
changes continuously over 1/8 of a wavelength A from linear to a (circular polarized 
light rotating in a counter-clockwise sense relative to the +z axis; see Sect. 13.3.3), 
from to linear in the next A/8, from linear to in the next A/8, etc., when the 
atom moves along the z axis in the standing wave. As an example, we take the case 








Fig. 22.12. Line shifts in a polarization gradient, (a) Counter-pro- 
pagating laser beams with mutually orthogonal linear polarizations 
create a total field which alternates from linear to circular polariza- 
tion along each eighth of a wavelength. The upper part of the figure 
shows the energy levels of the atom considered. The numbers on the 
lines connecting the different ground (g) and excited (e) sublevels 
indicate the relative transition probabilities, (b) The energies, 
shifted as described in the text, and the populations of the two 
ground state sublevels of this atom. The energies and populations 
change with the polarization of the light and thus with the position 
of the atom along the beam. The size of the population is symbol- 
ized by the size of the circles. (After Claude N. Cohen-Tannoudji 
and William D. Phillips: Physics Today, pp. 33-40 (1990)) 




Fig. 22.13. An atom moving within the laser configuration of 
Fig. 22.12 climbs out of a potential valley onto a maximum before 
it is optically pumped to the minimum of another valley. On the av- 
erage, the atom experiences more climbs than drops, and the 
resulting net energy loss cools it. The effect is near its optimum con- 
ditions in the case shown here, since the atom moves one-fourth of 
a wavelength in the mean time Tp; this is the average time which the 
atom must wait before it experiences another optical pumping cy- 
cle. (After Claude N. Cohen-Tannoudji and William D. Phillips: 
Physics Today, pp. 33-40 (1990)) 
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of an atomic transition from the ground state g with total angular momentum / = 1/2 
to the excited state e with J= 3/2. Because of the polarization gradients, the popula- 
tions and the energies of the two ground substates depend strongly on the position of 
the atom along the z axis. Consider an atom that is e.g. at rest at the position z = A/8, 
where the polarization is a “ : the absorption of a cr " photon can cause a transition 
from ^2 to 1 / 2 , from which state it can decay to the ground substate g_ 1 / 2 . If the 
atom decays back to the original ground substate g+\/ 2 , it can again absorb another 
G ~ photon and thus has another chance to finally reach the g ~ substate. If, in con- 
trast, a cr“ photon is absorbed by the g-\/i state, this will transfer the atom to the 
e_ 3/2 state, from which it can decay only to g-\/ 2 - It thus follows that in the sta- 
tionary state, the total atomic population will be pumped into the g-^n state. As may 
be shown, the g ~ transition which starts from the g -\/2 state is three times stronger 
than the g ~ transition starting from the g^+1/2 state. Correspondingly, the line shift 
A’_ from g -\/2 is three times larger than the shift A \ from g+\/ 2 - Here, we assume 
that the laser can be tuned to the red side of the transitions so that both line shifts are 
negative. In the case that the atom is located at 3 A/8, where the polarization is cr^, 
the above considerations are exactly reversed. The total population is in s^+ 1/2 and 
now we have A\ = 3zJ . Finally, if the atom is at a position where the polarization 
is linear, for example at z = 0, A/4, A/2 . . . , considerations of symmetry show that the 
two sublevels are equally populated and undergo the same lineshifts. 

All of these results are summarized in Fig. 22.12, which shows how the shifted 
energies and populations of the two ground state sublevels depend on z for an atom 
which is at rest at the position z. Naturally, on the average no force acts on atoms at 
rest, since their population is symmetrically distributed over the maxima and minima. 
If, on the other hand, the atoms are in motion, this symmetry will be disturbed and 
an average frictional force appears. Here, it is essential that the optical pumping pro- 
cess which generates the population distribution requires a finite time Tp. We consider 
for example an atom which is moving to the right and begins at A/8, where the popula- 
tion is pumped to the bottom of the minimum (Fig. 22.13). If its velocity v has just 
the right value so that the atom moves a distance of the order of A/4 during the time 
Tp, the atom will, on the average, remain in the same substate and will climb up the 
potential. At the maximum, it has a maximal probability of being optically pumped 
to the minimum of the potential well. From there, the same series of events can be 
repeated, as shown by the full curves in Fig. 22.12. Due to the time delay ip, the atom 
seems, like Sisyphus in Greek mythology, to be forced to climb the hill again and again, 
converting a part of its kinetic energy into potential energy each time. 

This physical picture shows us that the new cooling mechanism is most effective 
when the atom moves a distance of the order of A during the optical pumping time Tp. 
Thus, a velocity range of the order of = A/Tp, or, equivalently, kv^= 1/Tp, is re- 
quired. Since the optical pumping rate is proportional to the laser intensity /l, the 
range is also proportional to /l and tends to zero when /l goes to zero. In contrast 
to Doppler cooling, the velocity range in which an atom can be captured is independent 
of /l. On the other hand, the coefficient of friction of the new mechanism is large and 
independent of /l, while in Doppler cooling, it is proportional to /l. This important 
property is a result of the fact that when /l tends to zero, the long optical pumping 
times compensate for the smallness of the lineshifts. 

In addition to the “Sisyphus effect” described above, there are other laser cooling 
mechanisms which operate on the basis of polarization gradients, which we can how- 
ever not describe here. Although the new cooling mechanism was first observed with 
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sodium atoms, they are in fact not ideally suited for it; the theory is much more ap- 
plicable to cesium, which has a considerably stronger hyper fine structure. 

Because the photon transfers recoil momentum to the atom, causing it to carry out 
a kind of Brownian motion, one might assume that the recoil energy sets a theoretical 
limit to cooling, of the order of E = h^k^/lM, where M represents the mass of the 
atom. But here, again, this limit can be surpassed by means of another new mechanism, 
called “velocity-selective coherent population trapping”. In this case, the atoms are 
prepared as a coherent superposition of two ground-state sublevels, which cannot ab- 
sorb light: the two absorption amplitudes which lead from these two sublevels have 
completely destructive interference with one another. As soon as the atoms have been 
pumped into such a trapped state, fluorescence ceases. This phenomenon, which is well 
known from earlier experiments, can be employed for laser cooling. To this end, a 
“one-dimensional molasses” is used, in which the two counter-propagating laser beams 
have counterrotating circular polarizations. As can readily be shown (a Doppler shift 
is always present), the trapped state exists only for atoms with vanishing velocities. If 
their velocity is not equal to zero, the interference of the two transition amplitudes 
which depopulate the two sublevels is not completely destructive, and the atom can ab- 
sorb light. The greater the velocity, the higher the absorption rate. In order to populate 
the non-absorbing trapped state, one uses the new atomic velocity distribution which 
accompanies each cycle of spontaneous emission. For an atom which was originally in 
an absorbing state with a nonzero velocity, there is a certain probability that it will be 
pumped into the non-absorbing state with v = 0. When this happens, the atom is 
henceforth “hidden” from the laser light and thus from accidental recoils. In this man- 
ner, all such atoms remain at i; = 0, and we may expect that all the atoms will collect 
in a small velocity interval around v = 0. 

In these considerations, we have assumed that the atoms behave like classical par- 
ticles with respect to their center-of-mass motion. In fact, at such low temperatures 
(|iK) and with the small spatial distances involved, a quantum-mechanical description 
of the atomic translational motion is necessary. It may then be shown that the initial 
state is a linear combination of two atomic states which differ not only through their 
internal Zeeman quantum numbers, but also in their momentum quantum numbers p 
along the z axis. As a result, an initial state is present which is a double momentum 
state. This is confirmed by the experimental results (Fig. 22.14). Interestingly, the 
width of each of the peaks is less than the photon momentum hk, which again verifies 
that the width of the velocity distribution has become less than the recoil limit. The 
one-dimensional critical energy corresponds to a temperature of 2 pK. 




MOMENTUM 



Fig. 22.14. The momentum distribution of atoms in the 
direction of the laser beam, for helium-4 atoms which 
have been cooled by the velocity-selective coherent pop- 
ulation trapping method. The width of each peak is 
smaller than the momentum hk of a. single photon, 
which is an indication of cooling to below the recoil 
limit. The momentum distribution of uncooled atoms is 
shown by the bold-face curve. (Redrawn from A. 
Aspect, E. Arimondo, R. Kaiser, N. Vansteenkiste, and 
C. Cohen-Tannoudji: Phys. Rev. Lett. 61, 826 (1988)) 
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22.7 Nondestructive Single-Photon Detection - 

An Example of Atomic Physics in a Resonant Cavity 



The manipulation of atomic states in resonant cavities has developed into an impor- 
tant area of atomic physics. Here, we give as a particularly nice example the non- 
destructive detection of photons, which was carried out in the group of Serge Har- 
oche. This example will also show just how fruitful the analogy between a spin- 1/2 
system and a two-level atom can be. Normally, when photons are detected, they are 
annihilated by the detector, so that it is impossible to ‘see’ the same photon twice. 
In order to be able to detect a single photon nondestructively, one requires an ex- 
tremely strong interaction between matter and radiation. This can be realised with 
the aid of quantum electrodynamics in a resonant cavity, where the resonant cou- 
pling between a photon and an atom may be made much stronger than all the dissi- 
pative interactions with the surroundings. The experiment described below deals 
with states which have one photon or no photons and could serve as a logical ele- 
ment in a quantum computer (cf. Sect. 23.9). 

In a resonant cavity, only particular standing electromagnetic waves (‘modes’) 
with discrete energies can exist. Such a mode may be occupied by n photons, where 
n = 0,1,2,.... We consider one such mode here, denoted by C, with n = 0 or 
n=\. The atoms we treat here have three levels which are relevant to the experi- 
ments, and these are denoted by e,g and i (Fig. 22.15). 



C 
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i Fig. 22.15. The level scheme of an atom for nondestructive detection of a photon 



The transition between e and g is taken to be resonant with the photon mode C, 
while that between g and i is not. In the initial state, i.e. when an atom has just en- 
tered the cavity, we assume that there is one photon in mode C and the atom is in 
state g. The combined system atom -H photon then undergoes a Rabi oscillation at 
the frequency Q between the states |g, 1) and |e,0), analogous to the spin flips of 
spins in magnetic fields (we use the bra and ket notation here). These oscillations 
are treated in Sects. 15.5.1 and 15.3 as well as in Exercise 22.1. At the time t, the 
system is in a state of coherent superposition: 

If/ — cos(f2r/2)|g, 1) -f sin(f2t/2)|e, 0). (22.10) 

When the interaction time t is equal to In/Q, the atom is once again in the state g 
and the photon is present. However, the wavefunction (22.10) has changed its sign 
from -|-|g, 1) to -|g, 1) = exp(k)|g, 1), in complete analogy to the behavior of a 
spin 1/2 system (cf. Sect. 15.3). On the other hand, if initially no photon is present, 
or if the atom is in the state i, the state of the system remains unchanged. If we in- 
clude the states |/, 1) and |/,0) in the initial state as an additional superposition to 
(22.10), then we find that: 
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The superposed state with one photon 

( 22 . 11 ) 

is transformed into 

Cgexp(i7r)|^, 1) +q|/, 1), (22.12) 

while the state with no photon, 

c,\g,0)+ct\i,0) (22.13) 

remains unchanged. If q = 0 in (2.11), the transition to (22.12) implies only a 
change of the sign (phase) of the wavefunction, which, as we know, cannot be de- 
tected experimentally. In order to detect the existence of the photon, the coherent 
superposition (22.11), with Cg ^ 0 and q 7 ^ 0, is essential. The atom which was 
originally in the state g is therefore subjected to a pulsed, classical oscillating field 
before entering and after leaving the cavity. The frequency u of this field is chosen 
to be approximately resonant with the transition g i. The first pulse produces the 
coherent superposition (cf. Fig. 22.16) 

^(l^) + l*))- (22.14) 

The second pulse, in analogy to the case of a spin, “turns” the superposition 
“further” (Fig. 22.16). 
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Fig. 22,16a-d. Explanation of the behavior of the coefficients c of the wavefunction Cg|g,0) + c,|(, 0) 
using a vector diagram, (a) In the initial state, the atom is in the level g. (b) The first pulse rotates the 
vector (cg,q) by ;r/4. (c) The second pulse rotates the vector by 37t/4 in our picture, so that (apart from a 
sign) the original state g is again obtained. Finally, the atomic state is measured. Important note: in the in- 
terpretation using spins, the rotation angles are twice as large, (d) Temporal sequence of pulses 



For the case of exact resonance between i/ and the transition frequency I'/g, this 
rotation is complete and the original state |g,0) is again present. Variation of 
S = V — Uig changes the probability Pg of finding the atom in its ground state; it var- 
ies sinusoidally with S ■ t. This produces the socalled Ramsey fringes. The final 
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Fig. 22.17a-e. Similar to Fig. 22.16, but with one photon present, (a) In the initial state, the atom is in 
level g. (b) The first pulse rotates the vector (cg,c,) by n/A. (c) The interaction with the photon reflects 
the vector in the /-axis, (d) The second pulse rotates the vector by 3tz:/4, so that (apart from a sign) the 
state / is again reached. Finally, the atomic state - here / - is measured, (e) Temporal sequence of pulses 



atomic occupation states are then determined with a suitable detector. If the mode 
contains one photon, the amplitude Cg (for the atom in g) exhibits a change of sign 
(see above). This is equivalent to reflecting the vector represented by (22.14) 
through the i axis (Fig. 22.17). 

When the second pulse is applied, a rotation by SMtt is again produced, and the 
vector {cg,Ci) comes to rest on the i axis. The atom is thus in the state /, in contrast 
to the situation without a photon. By using models of this type, the behavior of the 
system with detuning, 5 — v - Vig ^ 0, can be understood. In the case of complete 
resonance, v = z//^, a perfect correlation between the atomic state (/ or g) and the 
photon number (0 or 1) is found. 

An alternative method for detection of the atomic states was previously sug- 
gested by V.B. Braginsky and F.Y. Khalili. It is based on a deflection of the atomic 
orbits rather than on interference. In the experiment of Haroche et al, rubidium 
atoms with the principal quantum number n = 50 (level g) and n = 51 (level e) 
were used. The single photon Rabi frequency in the centre of the cavity was 
^2/2n = 47 kHz. 



Problems 

22.1 Treat the interaction between a photon and a two-level atom using the Hamil- 
tonian 

H — E\d\ a\ + E 2 Q 2 a 2 + hcob^b + Ga^aib + Gd\a 2 b^ , (*) 

where E 2 — E\ — Hu. The symbols represent electron creation and annihila- 

tion operators, respectively, and b^^b are creation and annihilation operators for a 
photon. In the present context it is sufficient to treat < 7 + and a analogously to b'^^b. 
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Hint: use the following Ansatz for the wavefunctions; 

i// = ci{t)a^b+(l)Q + C2{t)a^(po, (**) 

where represents the common vacuum state. 

Verify the following rules: 

ai ■ <pQ = b'^ (pQ, a^a\a2(pQ = 0 

0.2 02 ■ 0'\b'^(f>Q = 0 , ^202 00 ^2 00 

0,20\b ■ O'lb^CpQ = 0.2<p(j, a2«l«2 0O'=O 

Oy 02b^ ■ (2^ b^(pQ = 0, 02b'^02 0o = b~^(pQ. 

What do these rules mean? Use Sects. 15.5.1 and 15.3. 

Identify ojb^4^Q with |g, 1) and <22 0o with |c, 0). 




23. Progress in Quantum Physics: 

A Deeper Understanding and New Applications 



23.1 Introduction 



Although quantum theory has proved itself to be a suitable tool for the explanation 
of a seemingly incalculable number of physical phenomena, its basic principles have 
been questioned over and over, even by scientists of undoubted genius such as Ein- 
stein. The resulting discussions have led to a deeper understanding, to the introduc- 
tion of new concepts, and to unexpected new applications. In the present chapter, 
we want to give examples of some of these developments. We begin by recalling 
the interpretation already given in Chaps. 7 and 9, which we shall cast in a some- 
what abstract form. 



23.2 The Superposition Principle, Interference, Probability 
and Probability Amplitudes 



Let us consider the example of a particle with the spatial coordinate r, whose wave- 
functions ^(r, t) obey a time-dependent Schrodinger equation. Then this wavefunc- 
tion may be represented as the superposition (or linear combination) of a complete 
set of orthonormalised basis wavefunctions (pj{r)J = 1,2...: 

(23.1) 

j 

where the (time-dependent) coefficients Cj{t) are termed the probability amplitudes. 
According to the probabilistic interpretation of quantum mechanics, jc;(t)j^ repre- 
sents the probability of finding the particle at the time t in the (quantum) state j, if 
we carry out a measurement of the corresponding quantum states. A special case of 

(23.1) is found when each individual summand in (23.1) is itself a solution of the 
time-dependent Schrodinger equation. 

In order to work out the difference between the rules of calculation (and the cor- 
responding interpretation) of quantum mechanics with respect to those of classical 
physics, we consider the special case of only two wavefunctions 0 i, 02 i which 

(23.1) reduces to: 

y/{r,t) = ci0i(r) + C202(O- (23-2) 

To compute the probability of finding the particle at the point r, we apply (23.2) to 
calculate 
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Fig. 23.1. (a) A Galton 
board with three pins, 
(b) Probabilities add 



kih0i(r)p + |c2n02(OI^+cIc20t(r)02W+C2Ci0Kr)0i(r). (23.3) 

We will illustrate this relation with the example of the double-slit experiment, in which 
an electron beam passes through two openings. Let <pi{r) be the wavefunction corre- 
sponding to the passage of the electrons through slit 1 and 02 (^) ^^at corresponding to 
slit 2. Analogous to wave optics, the terms in the second half of (23.3) represent the 
two-slit interference of the electron waves. For example, if we close slit 1, i.e. if we 
block that ‘path’, then the interference pattern disappears. However, in order to de- 
stroy the interference pattern in the quantum mechanical case, it is not at all necessary 
to block one of the slits. As has been demonstrated by sometimes very subtle experi- 
ments, it is sufficient to obtain knowledge of which electrons passed through which 
slit. In these experiments, the passage of the electrons is registered, e.g. by using cav- 
ity resonators. In order to detect the interference pattern, one naturally has to repeat 
such experiments many times, leading to a statistical average. 

If we now compare quantum mechanics with classical physics, we can carry out 
a classical analogy to the double-slit experiment. We consider a Galton board (an 
early precursor of the pinball machine) with three pins (cf. Fig. 23.1). 

We now allow balls to fall down onto the upper pin, causing the ‘particle beam’ 
to split into right and left beams; chaos theory tells us that it is classically impossi- 
ble to predict their trajectories exactly. When the balls hit pin 1, they will be scat- 
tered, leading to an impact probability on the x-axis of p\{x). Correspondingly, we 
have P 2 {x) for pin 2. If both pins 1 and 2 scatter the balls, the impact probability is 

p{x) = Pi{x) +P2(x). (23.4) 

If we compare this result with (23.3), we can see the basic difference between clas- 
sical and quantum physics: in the case of the latter, interference terms occur; in the 
former, they do not. This also indicates how one may pass from a quantum descrip- 
tion to a classical description by removing the interference terms. An important ex- 
ample is that of phase fluctuations, to which we will return later in the section on 
decoherence. 

Using the bra and ket notation, (23.2) may be written in a more abstract form: 

|^)=ci|1)+C2|2). (23.5) 

In this expression, |1) and |2) may indicate position-dependent wavefunctions as in 
(23.2), or they may be spin wavefunctions | |) and | |) with spins ‘up’ | T) or 
‘down’ I i), or they may indicate single-photon states with horizontal | or verti- 
cal I I) polarisation directions. From this we can see how broad the applicability of 
the quantum-mechanical formalism can be. In order to extend the interpretation of 
the double-slit experiment to these general cases, we require a formal trick, since a 
characteristic quantity corresponding for example to the position r does not occur in 
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(23.5). Instead of considering a measurement of the position r of a particle, we 
speak more abstractly of the measurement of a quantum state which we indicate 
in the bra and ket notation by |(//^). The probability (when a wavefunction |t//) and a 
following measurement of the quantum state are considered) of finding a par- 
ticular state 1^^^) is ^iven according to the basic rules of quantum mechanics (cf. 
Chap. 9) by \{WkW)\ • Using (23.5), we compute 

(23.6) 

and obtain 

\{^>kW)\^ = + k2h(v/t|2)|^ 

+ c\c2{l\Wk){¥k\^) + 4(^i{'^\¥k)i¥k\'^)- (23.7) 

The terms in the second line are clearly the interference terms. For specific exam- 
ples, see Exercise 23.1. 



23.3 Schrodinger’s Cat 



Schrodinger, to whom we owe the very fundamental Schrodinger equation, had prob- 
lems with the interpretation of the quantum theory, in particular concerning the mea- 
surement process. In order to illustrate them, he invented the following thought experi- 
ment: we put a cat into a box, where we cannot see it, so that we cannot determine the 
quantum state of the cat. In addition, the box contains a radioactive atom, whose decay 
will release a poisonous gas and kill the cat. According to quantum mechanics, the 
moment of the radioactive decay is not predictable (rather only the decay probabil- 
ity). We thus cannot determine at a particular time whether the cat is alive or dead, 
and have to describe its state quantum-mechanically by a wavepacket 

1^) = Cl jalive) + C 2 |dead). (23.8) 



This wavepacket is thus a coherent superposition of the two states jalive) and 
I dead). Only when we open the box and look inside, and thereby in the language of 
quantum mechanics ‘measure the quantum state of the cat’, will the wavepacket of 
(23.8) be reduced to one of the states jalive) or jdead). We may make at this point 
however the justified objection that the “quantum state” of the cat cannot depend in 
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the interpretation of quantum mechanics seems to be present. How this contradiction 
can be resolved is the subject of the next section. 



23.4 Decoherence 



Although the solution of Schrodinger ’s riddle has long been clear to insiders in 
quantum mechanics, who have studied e.g. spin relaxation or the quantum theory of 
the laser, it has only gradually become known to the broader physics community. 
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Im Ci*C2 



Im Ci*C2 




Fig. 23.2. Spreading of the phase angles. In the end, the vectors mutually cancel 



As an example, we consider a magnetic moment of spin 1/2 which is processing in 
a constant magnetic field (cf. Sect. 14.2). We can then follow the time develop- 
ment of e.g. the expectation value for the spin operator in the x-direction. To this 
end, we use the wavepacket (23.5), i.e. 

1^^) — ^i \ T) + <^ 2 ! i), (23.9) 

to compute the expectation value 

{ll/\Sx\y/) = |ci|^(t l^jcl T)+ 1^2 I^U kjc| i)+ c\c2{] i)+ r*2^l(i 1*^x1 T)- (23.10) 

We have already met up with the explicit result in Sect. 14.2.4. Here, a different as- 
pect interests us, namely the behavior of the interference terms CiC 2 ,C 2 Ci. In fact, 
the spin is continually influenced by its surroundings, called “heat baths” in the lan- 
guage of thermodynamics. As is shown by the corresponding theory, the coupling of 
spins to a heat bath leads in particular to fluctuations of the phases ^he 

coefficients c\ and C 2 , which we therefore write in the form 

Cl = ax exp(ixi), C 2 = exp(i/ 2 )- 

2 

In the course of time, these fluctuations continue to increase. If we calculate e.g. \cj\ , 
the fluctuations cancel out. However, if we calculate c\c 2 and average over the phases, 
the phase average of c\c 2 tends towards zero with increasing time (cf. Fig. 23.2). 

In other words, the interference terms disappear and the quantum-mechanical de- 
scription is converted into the classical one. If we now consider a macroscopic ob- 
ject, such as a cat, it is not only subject as a whole to the influence of its surround- 
ings, but also its parts, again macroscopic, act on one another as heat baths and de- 
stroy the definite quantum-mechanical phase relations. Even without the intervention 
of an observer, i.e. without a measurement, the coherent wavefunction of the cat de- 
cays rapidly. If one cools non-hving objects (including their surroundings) to very 
low temperatures, then the heat baths may not be strong enough to destroy the quan- 
tum coherence and macroscopic quantum phenomena may occur, such as supercon- 
ductivity, superfluidity, or a Bose-Einstein condensation (see Sect. 23.11). 



23.5 Entanglement 



This term was introduced by Schrodinger (1935). If we consider two quantum sys- 
tems, e.g. two particles or two spins or two photons, and the two systems are not 
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coupled, then their total wavefunction is the product of the wavefunctions of the in- 
dividual systems. Thus, in the case of spins, we have 



Ti) ■ 1 12 ) = 


1 Tli2)) 


(23.11) 


ii) ■ 1 T 2 ) = 


1 ilt2)- 


(23.12) 



The subscripts serve to enumerate the spins and the right-hand sides of (23.11,12) 
can be considered as abbreviations for the left-hand sides. In the following, we 
leave off the subscripts. As we have seen in several examples in this book, a cou- 
pling between the subsystems produces a linear combination of (23.11,12), so that 
the total wavefunction has the form: 

k)=^(ITi)±UT))- (23.13) 

Such combinations of product wavefunctions (even when the product functions | t|) 
and I It) have general coefficients) are known as “entanglements” or entangled 
states. Since the spins are located at fixed positions r\, t2, their position dependence 
can be expressed explicitly by including their position functions ;^(ri,r 2 ) to obtain 

k) = ;^(l Ti) ± I (23.14) 

The “preparation” of such entangled states in systems consisting of several subsys- 
tems, e.g. spins, is in general an experimental problem requiring considerable skill. 

Why are these entangled wavefunctions so interesting? We will discuss the an- 
swer to this question in the next sections. 



23.6 The Einstein-Podolsky-Rosen (EPR) Paradox 



Let us look more closely at the wavefunction (23.14) and imagine that we have 
measured the z-component of the first spin. If we find that the spin is ‘up’, then we 
have verified the wavefunction | Ti)- This means not only that spin 1 is ‘up’, but 
also that spin 2 must be ‘down’ - without our having measured it! Or if we do mea- 
sure its direction, then the result must be |, even though the two spins may be arbi- 
trarily far apart. The result is even more general, since one can measure the spin in 
an arbitrary direction. A short calculation then shows (see Exercise 23.2) that if the 
one spin points in a certain direction, the other must point in the opposite direction 
- again independent of their separation. It appears here as though information is 
transmitted at velocities greater than that of light, a point which seemed to Einstein, 
the founder of relativity theory, to be highly suspicious. This gave him another rea- 
son (in addition to his opinion that “God does not play dice with electrons”) not to 
accept the quantum theory. He proposed the idea that the quantum theory is incom- 
plete, and that there must be hidden variables behind it. The whole subject is in fact 
much more subtle and the EPR- ‘Paradox’ is central to quantum mechanics and to a 
possible quantum computer. Intensive theoretical and experimental efforts have been 
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made to investigate the EPR effect more closely, and we shall discuss some of them 
in the following sections. In particular, photons have been included in these investi- 
gations. 



23.7 Bell’s Inequalities and the Hidden- Variable Hypothesis 



In consideration of the proposal by Einstein, Podolsky and Rosen that quantum theory 
is incomplete, John Bell made an attempt in 1964 to extend it within the framework of 
a local, causal theory by including as-yet-unknown ‘hidden’ variables X. In the process, 
he derived inequalities which permit an experimental distinction to be made between 
the validity of the quantum theory and that of the local, causal theory. In the following, 
we derive a version of Bell’s inequalities which is due to Clauser, Home, Shimony and 
Holt. 

The basic experimental setup is shown in Pig. 23.3. 

A (correlated) photon pair is created by atomic two-photon emission or by a non- 
linear optical process (see below). The polarisation of each of the photons is mea- 
sured by spatially separated observers A and B. These observers switch their polari- 
sers randomly back and forth between two mutually orthogonal directions ai ,«2 or 
P 1 J 2 Por each orientation of the polarisers, only two results are possible: a polari- 
sation horizontal relative to the unit vector chosen (denoted by -i-l) or vertical (de- 
noted by —1). Let us now investigate the consequences of the socalled local realistic 
theory (LRT) of hidden variables for particular results of the measurements. Accord- 
ing to a theory of this type, the physical state of the two photons delivered by the 
source (or, more generally, of two particles) must be completely described by the ‘hid- 
den’ variables. If the source is subject to fluctuations, it will produce particles of dif- 
ferent X. This is taken into account in the theory by introducing a probability distribu- 
tion P{X). The latter is normalised as usual, so that 

Jp(A)£a=l. (23.15) 

In this theory, the values which are measured by the two observers can depend only 
on their own measuring apparatus (e.g. the chosen direction of polarisation) - this 
property is locality. The measurements which are read off by an observer are deter- 
mined a priori as soon as the corresponding value of X is fixed {reality). The results 



Polariser A 



Polariser B 






Atom emits two 
entangled photons 




Fig. 23.3. Schematic setup of the photon correlation experiment 
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of measurements by A or B thus depend only on the hidden variables X and on their 
own measurement apparatus. In particular, the results of one observer must be inde- 
pendent of the choice of polarisation direction by the other observer, in contradic- 
tion to the prediction of quantum mechanics for entangled states. The crucial experi- 
ment is thus the correlation between the events observed by A and B. For a series 
of emissions of photon pairs, A and B simultaneously determine the polarisation di- 
rections of the individual photons. We define the measurement quantity a\ for obser- 
ver A when his polariser is in the direction 0 |. In the sense of the local theory of 
hidden variables X, a\ is determined by X and has, depending on the horizontal or 
vertical polarisation of the photon, the value +1 or -1, so that a\{X) = ±\. Analo- 
gously, the polarisation direction «2 has the possible measurement values 
a 2 {X) = ±1. For observer B we introduce corresponding measurement values 
b\{X) = ±\ for the polarisation direction and b 2 {X) = ±1 for ySj- From the com- 
plete series of measurements from all the emission events, we select those for which 
the directions a\ and were chosen. We then multiply the two measurement values 
a[{X) and b\{X) for the same value of X, i.e. we compute ai{X)b\_{X) and take its 
average over the entire selected series of measurements. In the limit of a very large 
number of measurements, the local realistic theory of hidden variables yields, in- 
cluding the probability P{X) for the hidden variable X, the following expression: 

{a\b\) = J P{X)ai{X)bi{X)dX. (23.16) 

Expressions of this type are called correlation functions. We can naturally also com- 
pute the correlation functions of the other combinations of polarisation directions: 

(«1^2), («2^l), (02^2)- 

The trick which lies behind Bell’s inequalities is the choice of a linear combination 
of these correlation functions which can be estimated in such a way as to allow an 
experimental distinction to be made between local theories of hidden variables and 
quantum theory. To this end we calculate the following quantity: 

K = \{a\bi) -f (< 22 ^ 2 ) + {nib\) — {a\b 2 )\- (23.17) 

Since the correlation functions have the form (23.16), the above expression can be 
written as 



K = 



J P{X)[ci\[X)bi{X) a2{X)b2{X) a2{X)bi(X) — a\{X)b2{X)]dX . (23.18) 

The quantity in square brackets can be either negative or positive, so that 

K< J P[X)\[a\b\ 02 b 2 ci 2 b\ — a\b 2 ]\dX. (23.19) 

If we write the expression in brackets from (23.19) in a somewhat different form, 
we can see immediately that 



|(ai -|- 02)b\ -f («2 ^ d[)b2\ — 2. (23.20) 

From (23.15)-(23.20) we can directly derive Bell’s inequality in the formulation of 
Clauser, Home, Shimony and Holt, which we give explicitly: 
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l(flifci) + {a2b2) + {a2h) - {aib2)\ < 2. (23.21) 

It is important to note that all the expressions in (23.21) can be determined experi- 
mentally; their 2-dependence is only fictitious and the averages are carried out over 
the actual measured values aibj in (23.21). 

Let us now turn to the quantum-mechanical evaluation of the expressions in 
(23.21). As we know (cf. Chap. 9), classical quantities are associated with operators 
in quantum theory. Since each photon may take on only one of the two measured 
values ±1 depending on its polarisation direction, and since it represents a quan- 
tum-mechanical system, we can use the spin matrices in our formulation of the op- 
erators. The measurement of a polarisation state with the measured values ±1 thus 
corresponds to the measurement of a spin state with its projection in the correspond- 
ing polarisation directions. We use the Pauli spin matrices: 

and compute with the help of the polarisation vectors a 1,02 and P\,fi 2 ^he operators 

aj = ajtT, bj = Pjo^, 7= 1,2. (23.22) 

It is now easy to define and compute the quantum-mechanical averages correspond- 
ing to {ajbj). To do this, we must replace the classical measured values aj^bj by 
their operators (23.22) and compute the quantum-mechanical expectation value 

WkW)- (23.23) 

It must then be calculated for an entangled quantum-mechanical state, e.g. of the 
form 

I V') = (I I)/1 1 ^)b ~ I ^)a I I)b) • (23.24) 

We identify the vertical polarisation direction of the photon, | |), with the spin state 
I t) and correspondingly | <->) with the spin state j |). We leave the calculation of 
(23.23) and (23.24) to the reader as an exercise (Problem 23.3) and just give the re- 
sult here: 

{y/\hibj\ii/) = -ttiPj. (23.25) 

If we choose the polarisation directions in such a way that 

{a\,P\), (aiJi), («2,y?2) (23.26) 

each enclose an angle of ;r/4, we obtain 

- {a\b2)gj = 2V2 > 2. (23.27) 

The replacement of the quantum theory by a local theory would be allowed only if 
the inequality (23.21) were obeyed. This can now be tested experimentally. 
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23.8 Experiments to Test BelFs Inequalities 



Experiments of this type have been carried out by a number of authors (cf. Biblio- 
graphy). The ingredients which are necessary for such an experiment may be sum- 
marised as follows: 



1) A source which emits pairs of correlated particles, each of which moves in a dif- 
ferent direction. 

2) Each particle enters an apparatus which analyses the correlated properties and at- 
tributes to the result a binary value (e.g. ±1). 

3) The operation of each apparatus is controlled by a switch which determines the 
parameter d\ (or S 2 ), e.g. the phase shifts of interferometers (for energy-time cor- 
relations) or the orientation of polarisers (for polarisation correlations). 

4) The classical information about the detection of a particle, i.e. the time and place 
of its detection, is then sent to a coincidence circuit which measures the number 
of temporally correlated events, Rij{Si,S 2 )jiJ = ±1. /?+_, for example, denotes 
the coincidence counting rate between + at apparatus 1 and — at apparatus 2. 
This allows the correlation coefficient 



E{SuS2 



+ R — — R ^ — — R — ^ 
/?++ + R — + R ^ — + R — ^ 



(23.28) 



to be determined, where R(j = Rij{3 1 ^ 62 ). If four analysers are employed, two on 
each side of the source, then a Bell’s inequality in the Clauser-Home-Shimony- 
Holt (CHSH) form: 



5 = E{dx , J 2 ) + E{di , 4) + \E{d [ , d2) - E{d[ , 4) I < 2 (23.29) 



[which is a generalisation of (23.21)] would hold if a “local hidden variables the- 
ory” is valid. Here, di^d'-{i =1,2) are the parameter values (e.g. phases). 

An experiment of this type was carried out in the laboratory of W. Tittel et al. (cf. 
Bibliography), and it contradicted the inequality (23.29), i.e. the assumption of local 
hidden variables. These authors also carried out spectacular experiments with three 
widely separated analysers, which also contradicted the theory of local hidden vari- 
ables. In this case, the source of energy-time entangled photons (see below) was in- 
stalled in a telephone station in Geneva. One of the photons travels through 8.1 km 
of installed optical communications fibre to an analyser in Bellevue, 4.5 km north 
of Geneva. Using a second installed fibre channel of 9.3 km length, the other 
photon is sent to a second analyser in Bemex, 7.3 km southwest of Geneva and 
10.9 km from Bellevue. The absorption in the fibres is 5.6 dB and 4.9 dB, respec- 
tively, which leads to an overall loss of coincidence counting rate by a factor of 10. 
The analysers are fibre-optic interferometers with identical differences in optical 
path. At the output of the interferometers, the photons are detected and the now 
classical signal information is returned to the source, where the coincidence electro- 
nics are located. The results of the measurements made with the various analysers 
are finally eompared in order to confirm the nonlinear correlations. 

A final word concerning the source used in these experiments: it consists of a 
pump laser (a commercially available laser diode) and a nonlinear optical crystal 
(KNbOa), in which the ‘signal’ photon and an ‘idler’ photon are generated pairwise. 
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All the experiments which have thus far been performed verify (23.27) and there- 
by the prediction that correlations of entangled states are not describable by local 
theories with hidden variables. The current efforts of the experimentalists are direc- 
ted towards closing two loopholes: 

1) According to the assumption of locality, the measurement results of observers A 
and B can depend only on the hidden variables of the photon pair and on the 
choice of their local polarisation direction. An exchange of information between 
A and B thus has to be excluded. In the experiments carried out in 1998 by Zei- 
linger and coworkers, A and B were at a distance of 400 m and the polarisation 
directions were chosen by random number generators. Each individual measure- 
ment took less than 1.3 ps. Passage of a signal from A to B, even at the velocity 
of light, could thus be excluded. Therefore, the CHSH inequality was violated in 
a convincing manner. However, the detector efficiency was only 5%. 

2) There is a second weak point of the experiments to date: it is possible to con- 
struct a model of a local theory with hidden variables (to be sure in a rather arti- 
ficial way), in which the CHSH inequality is maximally violated. In order to ex- 
perimentally rule out this argument, a detector efficiency of 82.8% is required. 
Various groups are preparing experiments using entangled atomic states to make 
such a test. 

Finally, we note that an experiment is in preparation which uses three entangled par- 
ticle states in an attempt to test the assumption of a local theory of hidden variables. 
It makes use of a socalled Greenberger-Home-Zeilinger state, 

M ~ ~ I i)Al 1 ) b \ i)c)’ 

which can be realised by using nuclear spins in a molecule or by using photons. 
The loopholes are thus gradually being closed. The trend has reversed: the assump- 
tion of nonlocality has been made the basis of a new type of computer, the quantum 
computer. 

Let us close our brief excursion into this type of basic research with a short phi- 
losophical discussion of why two quantum systems which are originally correlated, 
so to speak at the same spatial position, can retain their correlation even when they 
are widely separated from each other. If the one particle reveals its - by no means 
predictable - quantum state, e.g. its polarisation direction, then that of the other par- 
ticle, which may be at a great distance from the first, is instantaneously determined. 
It seems that the correlation is based on conservation laws, e.g. overall spin equal to 
zero. Nature does play dice, but in such a way that the conservation laws are respected. 



23.9 Quantum Computers * 

23.9.1 Historical Remarks 



The marriage between quantum physics and computer science is not yet very old. In 
1982, Richard Feynman formulated a first quantum computer. At the same time, he 
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recognised that the simulation of a quantum computer using a classical computer 
would require an uncontrollably long time. This led him to the assumption that 
quantum computers could accelerate the calculational process so greatly that certain 
computer problems currently termed “hard” could be solved with them. In 1985, Da- 
vid Deutsch carried this idea further and found that a quantum computer could work 
in a massively parallel fashion by operating on a very large number of classical 
states. Thus, a single computation which consists of 300 qubits (see below) could 
have the same effect as 2^^^ simultaneous calculations based on classical bits (see 
below), i.e. more than the number of atoms in the known Universe. 

Finally, in 1994, Peter Shor discovered how the quantum computer could be 
used to solve a mathematically “hard” problem, the computation of the prime-num- 
ber factors of a very large number. In order to factor a number with 400 places 
using current methods, a network of hundreds of powerful workstations would re- 
quire ca. 10 billion years (of the order of the age of the Universe), while a quantum 
computer would need only minutes. The suggestions mentioned here are theoretical 
concepts. The further development or new development of such concepts must go 
hand in hand with their material implementation - we will return to this point later. 



23.9.2 Review of Digital Computers 

In order to better understand the concepts of a quantum computer, we recall some 
of the fundamentals of currently used computers, with which most readers will be 
familiar. Two ingredients are necessary for a computer: a data set and a programme. 

a) The data are represented digitally in the form of a sequence of zeros and ones 
(“binary system”), for example 10010. Each place in such a sequence represents 
a possible choice between 0 and 1. One such possible choice corresponds to 1 
bit. A sequence with n places corresponds to n bits. If we list all the possible 
ways of realising a sequence with n places, we find 

2 X 2 X ... X 2 = 2'^ 

V " 

n 

possibilities. 

b) Programme and implementation: the programme has to process the stored data 
collectively in a prescribed manner, e.g. to add certain data together, multiply 
them etc.; it then has to store them in a memory or carry out further computa- 
tional steps. There are correspondences between computational steps such as ad- 
dition and multiplication on the one hand and logical relations such as ‘or’ and 
‘and’ on the other. The relations follow the rules of Boolean algebra. The pro- 
gramme runs on a material substrate, i.e. the individual computational steps are 
realised by certain switching elements, which at present are almost entirely elec- 
tronic. For example, the zero may correspond to the resting state, the one to the 
active state of a switching device, also known as a gate. Multiplication of two 
bits then follows the scheme shown in Table 23.1: 
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Table 23.1. Multiplication scheme for two bits (leftmost column and upper row) and its result (box at 
lower right) 





0 


1 


0 


0 


0 


1 


0 


1 



Every digital computer can be constructed from primitive switching elements 
with one or two inputs, which are connected together by data lines. A particular- 
ly important switching element is the “exclusive or” (XOR), whose table is given 
by the following scheme: 

Table 23.2. The XOR operations 





0 


1 


0 


0 


1 


1 


1 


0 



Another operation is negation, which makes a zero into a one and a one into a 
zero: 

0 -> 1 
1 ^ 0 

All together, the operation of a digital computer can be summarised as follows: a 
set of input data is converted into a set of output data. 



23.9.3 Basic Concepts of the Quantum Computer 

Again, it is expedient to distinguish between the data set and the programme, whereby 
the material implementation now has to be considered with special regard to quantum 
physics. 

a) Data set: instead of bits, i.e. the possible choice between 0 and 1, we have qubits 
(quantum bits). The meaning of this concept can best be seen with the example 
of a spin 1/2 system. Its wavefunction y/ can be represented as 

\y,>=a\^)+l]\i) (23.31) 

where |ap + \p\^ = 1. The possible choices in (23.31) form a qubit. Speaking ab- 
stractly, a qubit corresponds to a 2-dimensional Hilbert space. (Reminder: a Hil- 
bert space is spanned by a set of orthonormalised vectors ejJ= l,...,n and is 
formed by all the possible linear combinations of these vectors with complex 
coefficients, where the linear combinations are normalised to 1; cf. Exercise 
23.4). This makes many kinds of realisations possible, e.g. by using atoms with 
two levels, with photons, with nuclear spins, etc. A sequence of bits now corre- 
sponds to a sequence - or more precisely, an ensemble - of qubits, which could 
be realised in the form of spins, for example. In consideration of the material 
which follows, we need to be familiar to some extent with the mathematical re- 
presentation of such an ensemble, which corresponds to a physical state with a 
wavefunction. We shall illustrate this by using spins - but any other quantum- 
mechanical two-level system would do as well. For one qubit, the states can be 
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described in terms of wavefunctions of the form | |), | |) or also as a| |) + |) 

with \a\ + — 1. More interesting is the case of two qubits. In the simplest 

case, we multiply the wavefunctions of the two spins 1 and 2, for example 
0(1) • 0(2). In the bra and ket notation this is then represented by | |) • | T)> 
which we can abbreviate as | t|). More generally, we have for example 

(«i| T)+Ali))-| T), (23.32) 

which, when multiplied out, gives 

«i| TT)+ySil it). (23.33) 

It is clear that the four states 

I TT),I Ti),UT),l U) (23.34) 

form a basis, from which all possible initial states with 2 qubits can be con- 
structed by taking linear combinations of the form: 

Cll TT)+C2| U)+C3| it)+C4| U). (23.35) 

In particular, entangled states belong to this set (cf. Sect. 23.5). This scheme can 
naturally be extended to arbitrary numbers of qubits. For three qubits, the linear 
combinations of | ttt), | TTi)> I Tit)> etc. come into play. Here, the first great 
challenge to quantum physics becomes apparent: the preparation of suitable ini- 
tial states as input data. 

b) Programme and implementation: as in the case of the digital computer, we can 
again show that the programme may be divided up into elementary steps. It is 
sufficient for each step to require one input, i.e. a qubit, or two inputs, i.e. two 
qubits. A decisive point for the whole concept of the quantum computer is that 
all the steps be carried out in a quantum-mechanically coherent manner. What 
does this mean? A quantum-mechanical initial state, represented by a wavefunc- 
tion f//^(0), which is the solution of a Schrodinger equation, is to be converted 
into a new wavefunction y/{ti) after a certain time t\. A concrete example would 
be a spin flip, where a spin initially in the “up” state (corresponding to a 1) is 
flipped into the “down” direction (corresponding to a 0), or vice versa. The asso- 
ciated switching element would thus correspond to a logical negation. If we use 
two inputs, i.e. two qubits, then we can imagine two spins in particular states (as 
input) which interact with each other, giving rise after a certain time to a new 
spin state (the output). Since however all possible input states (not only 0 and 1 
as in the case of digital computers) can occur, a large number of possibilities of- 
fers itself. Speaking abstractly, all the switching operations are equivalent to uni- 
tary transformations. (Reminder: a unitary transformation corresponds to a rota- 
tion and/or reflection of vectors within a Hilbert space.) The transformation ma- 
trices U = (ujic) and the associated matrix W= {u*f) have the property 

^ ^ ^kk' ■ 

j 

In summary, we can make the following statement: a quantum computer can be con- 
structed from unitary transformations involving 1 and 2 qubits with data lines 
(“wires”) connecting the switching elements (gates). The quantum-mechanical co- 
herence must not be lost in these wires. 
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Let us consider some examples: the logical negation corresponds to the unitary 
transformation 






0 1 
1 0 



since: 

u\ = 

and 

U\ 



0 

1 



i) 



1 

0 



!)• 



(23.36) 



(23.37) 



An important 2 qubit gate is the socalled quantum-controlled NOT gate, which 
causes a quantum “exclusive or”. What does this mean? 

This gate, or switching element, flips the second of two qubits, if the first corre- 
sponds to 1, otherwise it leaves the second qubit unchanged (cf. Exercise 23.5). The 
corresponding unitary transformation thus has the following properties: 



C/101) = |01) 
C/|00) = 100) 
U\10) = \U) 
U\U) = \10), 



(23.38) 



As a brief exercise, the reader can express this relation in terms of spin functions 
(1 tt) etc.). The obvious task for quantum physics is, first of all, to find quantum 
systems which can carry out the functions of such gates. There have been a large 
number of suggestions and realisations and we can give only a few examples here. 
In quantum optics, single atoms can be captured in traps and cooled. With suitable 
laser pulses the internal states of these atoms can be manipulated in a controlled 
way. Using the methods of resonant-cavity electrodynamics, the interaction of single 
photons with atoms can be coherently controlled. With such methods, both 1 -qubit 
and 2-qubit gates can be constructed; cf. Sect. 22.7 and Exercise 23.5. Further 2-qu- 
bit gates are possible through the use of additional degrees of freedom, e.g. collec- 
tive vibrational modes of the atoms or ions in a resonant cavity. In nuclear spin res- 
onance, gates are realised by irradiation with radiofrequency pulses (cf. Chaps. 14 
and 20 of this book, and also our book Molecular Physics and Elements of Quan- 
tum Chemistry). In this way, simple quantum algorithms were first demonstrated in 
the laboratory. 

Further potential systems are expected from solid state physics, e.g. Cooper pairs 
in Josephson contacts or the spin states of electrons in quantum dots as carriers of 
qubits. This is, all in all, a fascinating new area of quantum physics. 



23.9.4 Decoherence and Error Correction 

As we noted in Sect. 23.4, every physical system is subject to the influence of its 
surroundings, leading in the case of quantum systems to a loss of coherence, as 
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mentioned there. This is true also of quantum computers. When quantum-mechani- 
cal coherence is lost, the great superiority of the quantum computer relative to the 
digital computer vanishes. At first, the situation seemed practically hopeless. How- 
ever, in 1995, Peter Shor and Andrew Steane discovered that quantum-bit error cor- 
rection is in fact possible. We can give only one example here, in which the error 
consists of the flipping of one single bit. We wish to protect the coherent superposi- 
tion 

a|i)+yS|T). (23.39) 

When the error occurs, (23.39) is changed to 

«|T)+yS|i). (23.40) 

Our task is to recognise this spin flip and to reverse it without disturbing the deli- 
cate superposed state, i.e. without changing a and p. Analogous to classical error 
correction, we introduce redundance into the system, by coding | J,) using three qu- 
bits as I ill) and | T) as | |||). Then (23.39) becomes 

«IIU)+y9|TTT). (23.41) 

If now a 1 bit error occurs, (23.41) is changed to 

«|TII)+i?|ITT). (23.42) 

As observers, we cannot directly measure the two states in (23.42), since that would 
destroy their coherent superposition. Instead, we need to make a collective measure- 
ment (see below) on two qubits at the same time: we must answer the question, “Do 
the first two spins have the same direction?” - without measuring them individually. 
In our example (23.42), we would find that the spins are in opposite directions. We 
then determine whether the second and third qubits are in the same or opposite di- 
rections. From these two measurements we can conclude that the first qubit was 
flipped relative to the other two and must now be flipped back in order to correct 
the error. 

To give a complete account of our example, we need to discuss the collective 
measurement in more detail: we want to find out whether two spins are in the same 
or opposite directions without detecting the spins individually. To this end we use 
successively two controlled NOT gates (see (23.38), above). We first prepare an an- 
cillary qubit (from the Latin “ancilla” = maid) in the state | |). Then the two gates 
act, whereby the “maid” is the target and the two qubits are the controllers, one 
after the other. If the qubits have the same direction, the “maid” qubit will either not 
be flipped at all or will be flipped twice, so that its direction will in either case be 
maintained; if, on the other hand, the directions of the two qubits are opposite, only 
one flip of the “maid” will occur. By detecting the “maid”, we can determine the 
collective property (parallel or antiparallel) of the two qubits. The “maid” is an es- 
sential part of quantum error correction, since it is used as a sink for the entropy 
which is introduced into the code block by the error: the ‘maid’ is warmed, while 
the protected quantum system is cooled. To protect quantum information over long- 
er periods of time, there must be a continuous supply of fresh ancillas. To get a feel- 
ing for the physical realisation of such error correction procedures, we recommend 
to the reader Exercise 23.6. 
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23.9.5 A Comparison Between the Quantum Computer 
and the Digital Computer 

At the beginning of Sect. 23.9, we mentioned the incomparably higher computing 
speed of the quantum computer. This, however, depends to a great extent on the 
problem to be solved. Some tasks can be performed exponentially faster, general 
search tasks only quadratically faster, and socalled blackbox functional iteration only 
linearly faster. 

In summary, we note that the increase in computing speed of the quantum com- 
puter is due to the fact that it makes use of entangled states during the computa- 
tional process. 



23.10 Quantum Information Theory 



In the previous section, we considered the quantum computer as one important de- 
velopment in the area of quantum information theory and technology. Other impor- 
tant topics are quantum cryptography, the transmission of coded information, and 
teleportation, where the quantum state of a particle is transferred to a distant particle 
by means of the Einstein-Podolsky-Rosen “paradox”. The “paradox” is in this case 
not employed as an argument against the validity of the quantum theory, but rather 
its effect is the essential basis for new techniques of information transfer. 



23.11 The Bose-Einstein Condensation 



23.11.1 Review of Statistical Mechanics 

When a system of non-interacting or weakly interacting particles is in thermal equi- 
librium at a temperature T, the particles distribute themselves amongst the various 
energy states. This is described by distribution functions f{E). If the particles are 
distinguishable, then the Boltzmann distribution holds (cf. also (2.8)). The probabil- 
ity of finding a particle in a particular state of energy E is given by: 

/(£) = Z(£)-‘ exp [-E/kT], (23.43) 

where Z is the partition function and k the Boltzmann constant. In quantum 
mechanics, particles such as electrons or photons (light quanta) are non-distinguish- 
able (identical particles). Then other distribution functions apply. 

a) If the particles have a half-integral spin, e.g. as in the case of electrons, then 
they obey Fermi-Dirac statistics with the distribution function 

1 

exp \^{E - 0] -f 1 ’ 



m = 



(23.44) 




23.11 The Bose-Einstein Condensation 



419 



where P — IjkT and C is the chemical potential. For T — 0, i.e. p ^ oo, f{E) 
shows the following behavior: when E < C, the exponential function in (23.44) 
tends to zero and 



m = 1. (23.45) 

When E > C, the exponential function tends to infinity and 

f{E) = 0. (23.46) 

At an arbitrary temperature T, C, is determined by the given number N of parti- 
cles via the relation 



r 

JEo 



g{E)f{E)dE = N, 



(23.47) 



where g{E) is the density of states (cf. (2.8)) and Eq is the lowest energy of the 
particles. 

b) If the particles have m integral spin (e.g. photons, some atoms and some nu- 
clei), then they obey Bose-Einstein statistics with the distribution function 



m = 



1 

exp \p{E - 0] - 1 



(23.48) 



Considering again the limit of very low temperatures, at which P ^ oo, we find 
for £■ < C that the exponential function tends to zero and 

f{E) -1; (23.49) 



this contradicts the definition of a distribution function, which cannot become 
negative. The contradiction is resolved when C is constrained to go to zero along 
with T. In order that, in the limit yS — > (X), the function f{E) remain finite, we 
must also require that E < C. We find, in accord with Einstein, that at very low 
temperatures all the particles are in the same state, which has the lowest possible 
energy; this state is called the Bose-Einstein condensate. These considerations to 
be sure make no statement about the condensation temperature, nor do they take 
the effects of interactions between the particles or of Heisenberg’s uncertainty re- 
lations into account. However: the race was on for the discovery of this con- 
densed state. 



23.11.2 The Experimental Discovery 

In the year 1995, the group of E.A. Cornell and C.E. Wiemann cooled ca. 2000 ru- 
bidium atoms to below 100 nanokelvin; they thereupon formed a Bose-Einstein con- 
densate for about 10 seconds. Let us consider this experiment step by step: 

A small amount of rubidium vapor is passed into a glass cell under ultrahigh 
vacuum conditions. Six laser beams from diode lasers meet at the centre of the cell; 
the laser frequency is chosen so that a trap is produced by the radiation pressure 
and at the same time the laser cooling effect acts on the atoms (cf. Sect. 22.6). In 
addition, the resonance of the atoms is tuned via a weak magnetic field so that they 
absorb for the most part only those photons which are travelling from outside the 
cell towards its centre. In this way, within a minute ten million atoms can be col- 
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Fig. 23.4. Bose-Einstein condensation: the measured density distribution of the cold atoms near the phase 
transition to a Bose-Einstein condensate. The image on the left was taken at a temperature slightly above 
the critical temperature 7c, the middle one just at Tq, and the right image far below Tq. Each image repre- 
sents an area of 600x600 pm. These images of the density distribution were taken after 17 ms of ballistic 
expansion of the atomic cloud following switching off of the magnetic trap (from Th. Hansch et al. 1999) 



lected within the laser trap and at the same time cooled to 40 microkelvin. Even this 
low temperature is, as was discovered, still a factor of one hundred too high for a 
condensate to form, and collisions with the laser photons prevent further cooling. 
Therefore, the lasers are now turned off; the atoms are then held in the trap by a sui- 
tably shaped, relatively strong magnetic field - they move within a magnetic poten- 
tial well. In it, they are subject to evaporation cooling, because the more energetic 
atoms can escape from the well, taking their (higher than average) energy with 
them. An important step was the construction of a narrow and deep well. 

In order to determine whether or not a Bose-Einstein condensate had actually 
formed, a shadow snapshot was taken. The tiny cloud of atoms is hardly visible to 
the unaided eye because of its small dimensions; to allow it to expand, the applied 
fields were removed: after one tenth of a second, the cell is switched off. The 
atoms, which are moving in all directions away from the centre of the trap, scatter 
some light and therefore cast a shadow, which was recorded with a video camera. 
From the pictures, the velocity distribution of the atoms and thus the temperature of 
the cloud can be determined. The velocity distribution is reproduced in Fig. 23.4, 
where the highest point corresponds to the lowest velocity. 

The condensed atoms have the lowest possible energy and therefore remain as a 
dense cluster in the centre of the trap even after the remaining atomic cloud has begun 
to expand. The snapshot shows the broadening due to Heisenberg’s uncertainty relation 
(cf. also Sects. 7.3 and 7.5). This relation appears as a macroscopic phenomenon in the 
case of the Bose-Einstein condensation. In contrast to Einstein’s original concept, 
whereby the condensate consists of particles, the wave character plays an essential 
role in this observed structure. Ketterle and coworkers have in fact shown that when 
two condensed atomic clouds overlap, an interference pattern is observed. 
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23.11.3 The Quantum Theory of the Bose-Einstein Condensation 

A detailed treatment of the quantum theory of the Bose-Einstein condensation is be- 
yond the scope of this book; furthermore, several aspects are still in a state of devel- 
opment. A few remarks are however in order. The Bose-Einstein particles are not 
distinguishable. Since they are also all in the same quantum state, they lose com- 
pletely their individuality and enter into a common coherent state. Indeed, the ana- 
logy to laser light offers itself here. Graham and Haken had already compared the 
production of laser light with the (Bose-Einstein) condensation in 1969, and Scully 
worked out more details of the analogy between the Bose-Einstein condensation and 
the nonequilibrium phase transition of laser light in 1999. 

Since the particles give up their individuality in the condensate, it is not very 
surprising that the whole condensate can be described by a single wavefunction 
which depends on only one variable. It obeys the Gross-Pitaewskii equation, which 
in the time-independent case takes the form: 

- -«) y'ir) = 0. (23.50) 

Here, m is the particle mass, V{r) is the external potential in which the condensate 
moves, and g = where a is the ^-wave scattering length which describes col- 

lisions between the particles. (A more detailed explanation: in quantum-mechanical 
scattering theory, the incoming wave is described by a plane wave and the outgoing 
wave as a superposition of etc. wavefunctions. At low energies, it is sufficient 

to take only the ^-partial waves with their corresponding radial dependence, of wave- 
length a, into account.) In (23.50), n{r) — \y/{r)\^ is the particle density and p the 
chemical potential. Equation (23.50) has the form of a Schrodinger equation, which 
however is nonlinear due to the n{r) term. Thus, (23.50) reminds us even more of 
the Hartree equation (19.59), and it is worthwhile to follow up this analogy, which 
sheds some light on the physical meaning of (23.50). The external potential V{r) in 
(23.50) corresponds to the Coulomb potential of the nucleus in (19.59), while the term 
in gn{r) in (23.50) is a parallel to (19.55). There, the Coulomb potential between the 
electrons 



47reolr - r, | ’ 

is to be replaced by an effective interaction potential between the atoms, e.g. the ru- 
bidium atoms in the experiment described above, where we can assume that all the 
atoms are in their ground state. In this way, the scattering length a enters the prob- 
lem. With further approximations, we can take the densities (/?j)|^, which are all 
the same, i.e. independent of Qj (Bose-Einstein condensation!), out of the integral in 
(19.55), yielding the factor n{r) in (23.50). Naturally our “derivation” of (23.50) is 
neither exact nor detailed, but it makes the physical content of the equation plausi- 
ble in the sense of the Hartree procedure. 
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Fig. 23.5. Atom laser: the observed density distribution of a matter-wave beam produced by an atom laser. 
The coherent atomic beam, which can be seen in the figure as a stretched-out ‘ridge’ extending towards 
the lower right, is composed of rubidium atoms and originates from a Bose-Einstein condensate. A weak 
radiofrequency field serves as a continuous and coherent mechanism for extracting the atoms from the con- 
densate; it transfers atoms from the magnetically confined condensate into non-confmed states. These atoms 
escape from the trap and are accelerated by gravity, resulting in a collimated beam of hitherto unattainable 
brilliance. The region shown in the figure has dimensions of 1.2 x 2 mm. Because of the high optical den- 
sity, the image of the condensate distribution (high peak) is saturated. (From Hansch and coworkers, 1999) 



23.12 The Atom Laser 

In Sect. 23.11, we noted the analogy between the Bose-Einstein condensation and laser 
light. The behavior of the condensate as a coherent wave becomes clear in view of the 
interference experiment of Ketterle et al. mentioned above. In 1999, Hansch, Bloch 
and Esslinger succeeded in constructing the first atom laser which was capable of con- 
tinuous operation - for 100 ms, until the condensate had been used up (Fig. 23.5). 

The principal problem here is the replenishment of the condensate to allow long- 
er operating times. The authors used a new type of magnetic trap which makes use 
of an extremely stable trap potential, with fluctuations below the 100 nT level. This 
magnetic trap, which is termed a “quadrupole Ioffe configuration”, requires only 
three magnet coils. The researchers used magneto-optically trapped atoms, 

loading them into a magnetic quadrupole and varying the field continuously in the 
parabolic geometry of the Ioffe trap. During this process, the trapped cloud of atoms 
moves towards one of the magnet coils, resulting in tight magnetic confinement. 
The trap is externally shielded, so that magnetic fields and their fluctuations from its 
surroundings are suppressed to below 10 nT. The continuous “output” can be oh- 
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tained only when the field fluctuations are held to a minimum; in particular, they 
must be much smaller than the variation of the magnetic trapping field over the di- 
ameter of the condensate cloud. To produce the condensate, the trapped atoms are 
then further cooled by a radiofrequency (rf) field-induced evaporation. In order to al- 
low atoms to be ejected from the condensate to form the laser beam, a weak rf field 
is switched on for 15 ms. To achieve continuous output coupling, the trapping field 
has to remain sufficiently stable to prevent the atomic resonance frequency from 
drifting away from the applied rf frequency. 

Just as laser light is enormously more intense than the light from normal, inco- 
herent sources, the atom laser beam is much more intense than the beams from con- 
ventional atomic-beam sources and is at the same time much more sharply focussed. 
According to Hansch, the atom laser “brightness” is at least 2 • 10^"^ atoms s“^ m“^. 
This high intensity and the strong spatial and temporal coherence of the atom laser 
beam lead us to expect important new applications. Among them are diffraction-lim- 
ited atomic beams with a diameter of less than 1 nm. New atom interferometers are 
also to be expected, and they will allow measurements of highest precision. 

Problems 

23.1 Identify |1),|2) with the spin functions | |),1 |) and with a\ ^) + P\ i). 
Which formulas result, when |1), |2) refer to linearly polarised light | J) ,1-) and 
\y/j^) to circularly polarised light? 

23.2 Show that the wavefunction 

^(l,2) = i(U)|i) + |T)IT» (.) 

can also be written as an entangled state of the wavefunctions 

0i(1)=^iU)i+^2|T)i, |J,P + |J2p = l, 

Corresponding results also hold for the wavefunction (23.14) treated in the text. 

Hint: compute 0j(l)0j(2) + 02(1)02(2) using the expressions above and derive (*) 
from it. 

23.3 Prove the relation (23.25). 

23.4 Show that the spin wavefunctions span a Hilbert space. 

Hint: use = | |) and ^2 = | i) as basis vectors. 

23.5 Convince yourself that the experiment described in Sect. 22.7 can be used as 
a gate. What roles are played by the photon and by the atom? 

23.6 Discuss the application of the experiment described in Sect. 22.7 to error cor- 
rection. 

Hint: identify the atom with the “maid” and | |) with the case of a photon present, 

I I) with the case of no photon present. 




24. Fundamentals of the Quantum Theory 
of Chemical Bonding 



24.1 Introductory Remarks 

As we know from chemistry, many atoms can combine to form particular molecules, 
e.g. chlorine and sodium atoms form NaCl molecules. But atoms of the same type can 
also form bonds, as, for example in the case of hydrogen, H2. Before the development 
of quantum theory, the explanation of chemical bonding was a puzzle to chemists and 
physicists alike. Bonding between ions, as in the negatively charged chlorine ion and 
the positively charged sodium ion, could, to be sure, be understood in the light of the 
Coulomb attraction between oppositely charged bodies; it remained, however, in- 
explicable that two similar atoms, which are electrically neutral (as, for example two 
hydrogen atoms) could form a bound state (homopolar bonding). Here it only became 
possible with the aid of quantum mechanics to attain a fundamental understanding. 
Even in the case of ionic bonding (also called heteropolar bonding), basic new insights 
have been obtained through quantum theory. For example, it must be understood why 
the ions form in the first place, and why the electron which is transferred from sodium 
to chlorine thus finds an energetically more favourable state. 

In the following, we wish to develop some important basic ideas for the quantum 
theory of chemcial bonding. It should be pointed out at the outset, however, that 
physics and chemistry are still far removed from a complete solution to these problems. 
This is a result of the fact that to understand chemical bonding, the interactions of 
several particles must in each case be taken into account: given n atomic nuclei and m 
electrons, one would have to find the complete wavefunction and the corresponding 
energies of the total system. In order to find suitable approaches to a solution in such 
problems, one must trust to intuition to a considerable extent at first, in order to 
gradually develop more systematic methods. In attacking this problem it is useful to 
keep in mind that the nuclear masses are much greater than those of the electrons. This 
fact makes it reasonable as a first step to treat the nuclear masses as though they were 
infinite. Then we may ignore the motion of the nuclei and treat them as fixed. In 
atomic physics, we were able to obtain much information from spectroscopic observa- 
tions and could direct our attention to both the ground states and to the excited states; 
in the study of chemical bonding, the determination of the wavefunction of the ground 
state of the particular molecule plays a more important role. We will therefore direct 
our attention in the following to this question, which is of basic importance for 
chemistry. Let us now turn to concrete problems. 



24.2 The Hydrogen-Molecule Ion H 2 

Certainly the simplest case of chemical bonding occurs in the hydrogen-molecule ion 
This species is observed as a bound state in gas discharges in a hydrogen 
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b 



Fig. 24.1. Overview of the hydrogen-molecule ion. The two nuclei 
are denoted as a and b, their distance apart as and give the 
distance of the electron from nucleus a and from nucleus b, respec- 
tively 



atmosphere; in such a gas discharge, the hydrogen molecule may have an electron torn 
out. The bonding energy, equivalent to the dissociation energy, has been determined to 
be 2.65 eV for H^. Here, we are dealing with two hydrogen nuclei, i.e. two protons but 
only one electron. The two nuclei may be denoted by the indices a and b (Fig. 24.1). If 
the nuclei are far removed from one another, we can imagine that the electron is 
localised on one nucleus or the other. The wavefunctions are then those of the 
hydrogen atomic ground state. In the following, we will denote the distance of the 
electron from nucleus a ox b by or rjj. If we call the wavefunction of the hydrogen 
ground state belonging to nucleus a 0a, the latter must satisfy the Schrodinger equation 






2mo ^neoKa, 






(24.1) 



^ a 

and correspondingly for the wavefunction 0^, with the energies Ea and being equal: 



E^a = El = E\ (24.2) 



If we now let the nuclei approach one another, the electron, which was, for 
example, at first attached to nucleus a, will experience the attractive Coulomb force of 
nucleus b. Conversely, an electron which was at first bound to nucleus b will experience 
the attractive Coulomb force of nucleus a. We must therefore set up a Schrodinger 
equation which contains the Coulomb potentials of both nuclei (Fig. 24.2). Further- 
more, in order to calculate the total energy of the system, we must take into account the 
Coulomb repulsion between the two nuclei. If we call the distance between these two 
nuclei the additional energy is e^/iAnE^rab)- Since this additional energy is not 
directly related to the energy of the electron, it will only produce a constant shift of all 
the energy eigenvalues. We will therefore leave this constant out of the calculations at 
first, and will reintroduce it at the end of our considerations. 

This leads us to the Schrodinger equation 









- — V \y/ = Eii/, 

2mo ^neora 4neori,/ 



(24.3) 



in which the wavefunction y/ and the corresponding energy E are yet to be calculated. 
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Fig. 24.2. Hydrogen-molecule ion. The potential energy V 
which acts on the electron due to the attractive forces from 
nuclei a and b is plotted. The dashed curves show the energy of 
an electron in the field of nucleus a or nucleus b alone. The full 
curve is the total potential energy. The abscissa is the position 
coordinate x. The binding energy of the electron in the field of a 
single nucleus, is also indicated 





Fig. 24.3. Upper portion: The wavefunction 0^ of an electron 
which is moving in the field of nucleus a and the corresponding 
wavefunction 0^ for the electron in the neighbourhood of nucleus 
b. Lower portion: If the distance between nuclei a and b is 
reduced, the two wavefunctions 0^ and 0^ begin to overlap 



We will now determine the wavefunction y/ approximately. For this purpose we will 
utilise an idea which is borrowed from perturbation theory with degeneracy (Sect. 
15.2.3). In principle, the electron could be in the neighbourhood of nucleus a or of 
nucleus b (Fig. 24.3), with the same energy in each case, as in (24.1) and (24.2). These 
two states <p„ and 0* are thus degenerate. Now however, the other nucleus, from which 
the electron is by chance more distant, acts as a perturbation to the electronic state. 
We can thus expect that the degeneracy will be lifted by this perturbation. Exactly as in 
perturbation theory in the presence of degeneracy, we form as a solution to (24.3) a 
linear combination: 



Ci0a+C206, (24.4) 

where the two coefficients Cj and C 2 are to be determined. To find their values, we 
proceed in the usual manner. We first insert the trial solution (24.4) into (24.3) 
and obtain 






2mo 



Ansora 



C\ 0a 






+ ( -— v^- 

\ 2mo 



47reor* 4;^^o/•a, 



= £'(^1 <t>a+^2(t>b) 



Jtu 



(24.5) 



We have collected the terms in the two brackets in (24.5) so that the operator 
operates on 0^ and on 0*. We can simplify these expressions immediately with the 
help of (24.1) and the corresponding equation for by writing £’a0a in place of 
J^a0a, for example. 
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If we now bring the right-hand side of (24.5) to the left side, we obtain 
[E^-E- ) Cl 0,+ Ie^-E- — ^ ) C20* = 0 . (24.6) 

V' — . — ' ^nsori,/ \ , — ' ^neoKa/ 

-AE -AE 



While <t)a ^nd 01 , are functions of position, the coefficients Ci and C 2 are independent of 
position. To find a position-independent equation for the c’s, we multiply (24.6) with 
01 or 0 ^, as we are accustomed from perturbation theory, and integrate over the elec- 
tronic coordinates. For this purpose we assume in the following that the functions 0 „ 
and 01 , are real, as is the case for the hydrogen-atom ground state wavefunctions. We 
must take the fact into account that the functions 0 ^ and 0 i, are not orthogonal, i.e. 
that the integral 

\<t>a0bdV=S (24.7) 

does not vanish. If we multiply (24.6) by 0 „ and then integrate over electronic 
coordinates, we obtain expressions which have the form of matrix elements, namely the 
integrals 



) 0 „(r,)rfK=C, (24.8) 

\AneorhJ 

\(t>a{ra)i—^^-A Mrb)dV = D . (24.9) 

\4neora/ 

which we have abbreviated with the letters C and D. The meaning of the first integral 
becomes immediately clear if we recall that - e 1 0 „ stands for the charge density of the 
electron. Expression (24.8) is then nothing other than the Coulomb interaction energy 
between the electronic charge density and the nuclear charge e of nucleus b (Fig. 24.4). 
In the integral (24.9), by contrast, instead of the electronic charge density, the expres- 
sion - c 00 01 , occurs. This means that the electron in some sense is partly in state a and 
partly in state b, or, in other words, an exchange between these two states takes place. 
For this reason, 0 o 0 * is referred to as the exchange density, and integrals in which such 
exchange densities 0 o 0 i, occur as exchange integrals (Fig. 24.5). These integrals 
represent a particular quantum mechanical effect. 

If we multiply (24.6) by 0 i, instead of by 0o and integrate, we obtain expressions 
which are quite similar to (24.8) and (24.9), except that the indices a and b are ex- 
changed. Since, however, the problem is fully symmetric with respect to the indices a 
and b, the new integrals must have the same values as the old. If we collect all the terms 
resulting from multiplying by 0 o and integrating, (24.6) becomes 

i-AE+C)c^y{-AE-S + D)c 2 = 0, (24.10) 

and correspondingly we obtain after multiplying by 0 * and integrating: 



{-AE- S+D)c^ + {-AE+C)C2 = 0. 



(24.11) 
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Fig. 24.4. Intuitive explanation of the integral (24.8). This 
integral represents the Coulomb interaction energy of an elec- 
tron density cloud with the probability density distribution 0^ 
in the Coulomb field of the nucleus. The density distribution is 
shown as a shaded area, while the full curve gives the potential 
energy of a point charge in the Coulomb field of nucleus To 
calculate the integral, at each point in space the value of 0^ 
must be multiplied with the corresponding value of 
-e^/(4 7T£oA’^) and the products must be added (integrated) 
over the entire volume 




Fig. 24.5. Intuitive explanation of the integral (24.9). The three 
functions 0^, 0^, and -eV(47r£o^^,) which occur in the integral 
are plotted. Since the product of these three functions occurs, 
nonvanishing contributions are only possible when the wavefunc- 
tions 0Q and 0^ overlap, as is indicated by the heavily shaded 
area. The integral is obtained by multiplying the functional values 
of 0^, 0^, and -eV(4 7r£oA*^) at each point in space and adding 
up over the whole volume 



These are two quite simple algebraic equations for the unknown coefficients and C 2 . 
In order that these equations have a nontrivial solution, the determinant must vanish, 
that is 

(-AE+Cf-(-AE- S + Df = 0. (24.12) 

This is a quadratic equation for the energy shift AE, which we can solve immediately: 

E = E^+AE = E^+ , (24.13) 

1±S 

where the two possible signs are due to the square root entering the solution of (24. 12). 
If we insert AE in (24.10), for example, we see that the upper sign corresponds to a 
solution with 

C 2 = -q = - c . (24.14) 

In this case, we obtain for the total wavefunction 



y^ = c{(l)a-(l)b) • (24.15) 

The constant c still has to be determined through the normalisation of the total wave- 
function. This wavefunction is represented in Fig. 24.6. If we use the lower sign in 
(24. 1 3), we obtain the coefficients C 2 = Cj = c and, with these, the total wavefunction 
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a 




Fig. 24.6. a) The symmetric wavefunction is formed by superposing the wave- 
functions 0a and 0ij . Because of the overlap between 0^ and 0^ , the occupation 
probability for \j/^ between the two nuclei is increased, b) Illustration of the 
density distribution of the electron in the state 





Fig. 24.7. a) The antisymmetric wavefunction is formed by taking the difference of 0^ and 0^. The occu- 
pation probability is clearly zero in the plane of symmetry between the two nuclei, b) Illustration of the 
density distribution of the electron in the hydrogen-molecule ion in the case of the antisymmetric wavefunc- 
tion \i/_ 



W = c{(t>a+ (!>b) (24.16) 

(Fig. 24.7). 

As may be seen in Figs. 24.4 and 24.5, the quantities S, C, and D depend on the 
internuclear distance rgi,. If the two nuclei approach one another, the electronic energy 
E splits up in a way depending on whether we are dealing with a symmetric state(24. 16) 
or with an antisymmetric state (24.15). In the symmetric case the energy is reduced; 
here, we may speak of a bonding state or bonding orbital. In the case of the antisym- 
metric wavefunction, the energy is increased and we may speak of an antibonding state 
(Fig. 24.8). 

In order to determine whether the hydrogen-molecule ion is, in fact, bound, we 
must add the Coulomb repulsion energy between the two nuclei a and b to the elec- 
tronic energy E. We then find for the bonding state that the total energy depends on the 
internuclear distance as shown in Fig. 24.8. One can recognise from this figure that 
there is a certain internuclear distance at which the total energy of the system has a 
minimum; thus, a bound state exists. 

As may be seen in Fig. 24.6, in the bound state the probability density of the 
electron is relatively large between the two nuclei. The electron can thus, from an 
energetic point of view, profit from the Coulomb attraction of both nuclei, which 
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Fig. 24.8. The energy E of the hydrogen- 
molecule ion, taking the Coulombic re- 
pulsion between the two nuclei into con- 
sideration. The energy curves are plotted 
to show their dependence on the inter- 
nuclear distance for the bonding and 
the antibonding states 



lowers the potential energy of the total system. In the antibonding state (Fig. 24.6), the 
probability density of the electron between the two nuclei is small, in the middle in fact 
0, which means that the electron only experiences the attractive force of one nucleus at 
a time. 



24.3 The Tlinnel Effect^ 

Using the example of the hydrogen-molecule ion, we may get an excellent description 
of an effect which is known as the “tunnel effect” and which represents a typically 
quantum mechanical phenomenon. Let us first consider the behaviour of a classical 
particle in a potential field, shown in Fig. 24.2. If the particle is first in the region of the 
left potential well with a total energy as represented by the horizontal line in the figure, 
it cannot pass the potential barrier between the left and right regions; it remains forever 
trapped in the left portion of the potential. We will now show that this is not true in 
quantum theory: a particle which is originally in the left-hand region of the potential 
can pass over into right-hand region after a time. 

In order to simplify the discussion somewhat, we will assume that the overlap 
integral S is small (S 1) and can be neglected. The energy of the electron can then be 
given in the form 

E = E^+C±D. (24.17) 

We consider the solutions of the time-dependent Schrodinger equation, which in 
general have the form 

y/(r,t) = , (24.18) 

For \j/{r) we will use the approximate wavefunctions (24.15, 16), and for E the corre- 
sponding energies (24.17). To simplify the notation, we introduce instead of the 
energies the frequencies, related hy E= hoj\ 



1 



The reader who is interested only in chemical bonding may wish to skip this section 
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(E^+C)/h=ojQ, 
D/h = coj’/2, . 
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(24.19) 

(24.20) 



In (24.20) the factor 1/2 and the subscript T were introduced for a good reason, as we 
will shortly see. According to the different solutions (24.15) and (24.16) we obtain the 
solutions of the time-dependent Schrodinger equation in the form 



K2 


(24.21) 


1 


(24.22) 



From such time-dependent solutions we can construct so-called wave packets by 
superposition of the two functions (24.21) and (24.22): 



ij/ir, t) = + di y/- , (24.23) 

where the coefficients d\ and di are time-dependent constants. By inserting (24.23) into 
the time-dependent Schrodinger equation 

= (24.24) 

where the Hamiltonian ^ is the same as in the Schrodinger equation (24.3), it is 
possible to convince oneself that the trial solution (24.23) is, in fact, an approximate 
solution to (24.24). We now choose d^ and d 2 in such a way that at the initial time ^ = 0, 
the electron is in state 0^ on atom a; wq do this by setting di = d2 and choosing both 
equal to 1/1/2 for reasons of normalisation. Then the wavefunction is given by 

y/{r,t) = y[(0^+0Z7) + (0a-06)exp(-icorO] exp(-icoo^ + icur^/2) . (24.25) 

^y/- 

If we allow the time t to vary, the relative phase between the states y/^ and y/_ changes, 
as can be seen from (24.25). 

We now recall a relation from mathematics, namely 

exp(ia>7’0 = cosct^T’^+i sincuj’/ . 

If we choose a time t = tQ such that corto = n, the cosine will be equal to - 1 and the 
sine to zero. For this to we obtain instead of (24.25) 

w(r, 0 = T [(0a + <pb) -(<Pa- 0b)] exp ( - i Wo ^ 0 + i Wt’^/2) 

= 0ftexp( — io»o^+ia»r^o/2) . 

The electron is thus now in the state 0^ on atom bl It has tunnelled through the 
potential barrier. After the twofold time 2 to it has tunnelled back to atom a. The 
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circular frequency for the tunnelling process is thus given hyln/ltQ, which is just the 
frequency introduced in (24.20) as coj. According to (24.20), cu^is, aside from con- 
stant factors, given by the exchange integral (24.9). The larger the overlap between the 
wavefunctions 0^ and 0^, the more rapidly the electron tunnels back and forth. 

The tunnel effect plays an important role in many areas of modern physics. An 
especially well-known example is the a decay of atomic nuclei, where the a particle is 
able to overcome the attractive nuclear potential barrier by means of tunnelling. 



24.4 The Hydrogen Molecule H 2 

We will now turn to the problem of chemical bonding when more than one electron is 
involved. Before we take up the simplest concrete example, that of the hydrogen 
molecule, we will make a general remark which is of fundamental importance for many 
problems in quantum theory. 

We meet up again and again with the task of solving a Schrodinger equation of the 
general type 



= (24.26) 

which is however often not possible in closed form. As well as the methods of solution 
which we have treated up to now, for example perturbation theory, there is another, 
fundamentally different method, which is based on a variational principle. 

To explain this method, let us imagine the Schrodinger equation (24.26) to be multi- 
plied by I//* and integrated over all coordinates on which j// depends. We then obtain 

E = 1 ^ W • • - dVfi ^ 2 ^ 27 ^ 

Since the Hamiltonian is the operator for the total energy, we have here nothing 
other than the expectation value for the energy, which in the present case is the same as 
the energy eigenvalue of the Schrodinger equation. What will happen now, if instead of 
the true solution of the Schrodinger equation, we substitute another function for y/l 
Then expression (24.27) still has the dimensions of an energy, but it must no longer 
represent a correct eigenvalue of the Schrodinger equation, which is what we are 
seeking. Mathematically, one can now prove an extremely important relation: if we in 
fact use some function other than the correct eigenfunction ij/ for the ground state, the 
energy expectation values which we obtain will always be higher than the eigenvalues of 
the solution to (24.26). In this sense, we can define a criterion for the quality of our 
approximate wavefunctions: they are better and better, the lower the corresponding 
energy expectation values. 

We shall use this criterion repeatedly in later sections. Now we will return to the 
problem of determining approximately the wavefunction and the energy of the 
hydrogen molecule in the ground state. 

The two atomic nuclei (protons) will be denoted by the indices a and 6, the electrons 
by 1 and 2. Since the Coulomb force acts between all the various particles, we must 
introduce the corresponding distances, which are defined in Fig. 24.9. In order to write 
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Fig. 24.9. Overview of the hydrogen molecule. The two nuclei 
are denoted by indices a and b, the two electrons by indices 1 
and 2. The distances between the nuclei, between the electrons, 
and between electrons and nuclei, and the notations used for 
them, are shown in the sketch 



the Hamiltonian, we recall again the energy balance of classical physics: we are con- 
cerned here with the kinetic energies of electrons 1 and 2, as well as with the various 
contributions of the Coulomb energy. If we express the kinetic energies as quantum 
mechanical operators, we obtain for the Hamiltonian the expression: 



je=~- 



Vf- 



2m 



47tSorai 2m 



VI- 



4 near 1,2 



Jl\ 



Jir 



4neori,i 



+ + 

47t£o/'a2 4nsorai, 4neori2 



(24.28) 



V] and V 2 are Laplace operators (19.3) which refer to the coordinates of electrons 1 
and 2, respectively. We once again assume that the nuclei are infinitely massive. 

We have the task of solving the SchrOdinger equation 



Jfy/(ri,r2) = Ew(i'ur2) (24.29) 

with the Hamiltonian (24.28). If the nuclei were infinitely far apart, it would suffice to 
treat each atom independently, i.e. to solve the equations 



2 oti 



•V?- 



4neor„i 



and 



^ v|. 



2m 



4?teof62; 



<Pa(ri) = Eo0a(ri) 



0b(r2) = Eo0b(r2) 



(24.30) 



(24.31) 



Since we are concerned here with a two-electron problem, we must take the Pauli prin- 
ciple into consideration (Sect. 17.2), that is we must take special account of the fact 
that the electrons possess spins. If the two hydrogen atoms did not interact with one 
another, we could immediately write down the total wavefunction with the help of the 
two wavefunctions and which occur in (24.30) and (24.31). As may be seen by 
substituting into the Schrddinger equation with Jf = + jf 2 > 
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(24.32) 

would be a solution. 

To take account of the spins of the electrons, we must multiply this solution with 
spin functions. The reader who is not familiar with the spin formalism need not despair 
at this point, since we will use only a very few properties of the spin functions and then 
can ignore them in the further course of the calculation. 

We denote the function which represents an electron with its spin up by a; a spin 
wavefunction of this type was called in Sect. 14.2.2. If the electron in question is 
electron 1, we will call the spin function a(l). If both spins point in the same direction 
(upwards), our wavefunction is then given by 

d>a(ri)0b(r2)a(\)a(2), (24.33) 

This, however, does not satisfy the Pauli Principle, which in its mathematical interpre- 
tation requires that a wavefunction be antisymmetric in all the coordinates of the 
electrons, or, in other words, that the wavefunction reverse its sign when we exchange 
the indices 1 and 2 everywhere. The wavefunction (24.33) does not have this property. 
By contrast, the wavefunction 

W = 0fl(^i) «(f ) 0b(^2) «(2) - 0a(^2) «(2) 0^(/*i) «(1) (24.34) 

does. If we factor out a(i) and a(2), this wavefunction is reduced to the simple form 
y/ = a(l)a(2)[0a(/-i)06(r2)- 0a(r2)06(''i)] . (24.35) 

' V ' 



that is a product of a spin function and a function which depends only on spatial coor- 
dinates. [In quantum theory, wavefunctions which are symmetric with respect to 
exchange of the electronic spatial coordinates are denoted as “even” (symbol “g”, from 
the German word “gerade”), while those which are antisymmetric are called “odd” 
(“u”, from the German word “ungerade”)]. 

As we know from the study of atoms, the energetically lowest states, of the lighter 
atoms at least, are constructed in a many-electron atom by filling the states from the 
lowest upwards with electrons of antiparallel spins. We therefore expect, and the expec- 
tation is confirmed by calculation, that the wavefunction \j/ in (24.35) does not 
represent the energetically lowest state, since here the spins are parallel. We must 
instead consider a wavefunction in which the spins are antiparallel, in which one 
electron is described by a “spin up” function a and the other by a “spin down” func- 
tion p. Here there are a number of various possibilities obtained by combining expres- 
sions of the form (24.33). One such possibility would be 

(!>air\) 0fc(''2) a(l))8(2) . (24.36) 

Other trial functions can be obtained from (24.36) by exchanging the coordinates 
and /*2 or the arguments of a and yff, i.e. 1 and 2, or by exchanging all the coordinates 
with one another. None of these combinations is itself antisymmetric. We shall try to 
find a combination of (24.36) and similar trial functions of the sorts just mentioned 
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which will be antisymmetric and permit the factorisation of the wavefunction into a 
spin part and a spatial part, as in (24.35). This is, in fact (as one discovers after a bit of 
trial and error), possible, and one finds a trial function 

[0a(/*i)0z.(^2) + 0 .(^ 2 ) 0^(^i)][a(l))8(2) . (24.'37) 

V / 

J//g 

The spin function is here visibly antisymmetric, while the spatial part is symmetric. If 
we exchange all the spatial and spin coordinates of the two electrons simultaneously, 
we find in agreement with the Pauli principle an antisymmetric wavefunction. The spin 
functions were just an aid to help us take the overall symmetry of the wavefunction into 
account. Since, however, no operators occur in the Hamiltonian (24.28) which in any 
way act on the spin, we can completely neglect these spin functions in our following cal- 
culations of the energy. This means that in the approximation used here, the interaction 
of the spins with each other (spin-spin interaction) and the interaction of the spins with 
the spatial functions (spin-orbit interaction) are not taken into account. We will 
concern ourselves from now on only with the spatial wavefunctions and ^g. 

Following the basic suggestion of Heitler and London, we imagine these wavefunc- 
tions and yj^ as approximate solutions of the Schrddinger equation with the Hamil- 
tonian (24.28), which contains all the Coulomb interactions between electrons and 
protons, and assume that using them, the exact energy is approached with the aid of 
(24.27). We thus have the task of calculating the energy expectation value for these 
wavefunctions. 

As a first step in the calculation of the expectation value, let us consider the normal- 
isation integral which occurs in the denominator of (24.27). It has the form 

\\Wiruri)\^dVidV2 

= \\[<Pair\)(l)bir2)±(l)a{r2)(pb(ri)] 

X [(t>a(ri)(t>b(r2)±(l>a{r2)<Pbiri)\dV,dV2, (24.38) 

where we have assumed 0^, 0^ real. 

After multiplying out the terms we obtain 

j 1 0 a f-d V,\\<PbUV2+\\0afdV2\\0bUV, 

± 1 0air\ ) 0b{r\)dV^ I (f>a(r2) 0bir2)dV2 

±\0a(r2)0b(r2)dV2\0b(r,)(l)airx)dV, . (24.39) 

The first two expressions can be reduced because of the normalisation of the wavefunc- 
tions 0a ^nd 0^ to 



\\0aUv, = \\(DbUV2=\, (24.40) 

while the remaining terms are squares of the overlap integral 



\0a^r\)0bir\)dVx = S. 



(24.41) 
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With these results, we can write the normalisation integral (24.38) in the simple form 

2(1 ±S^). (24.42) 

In evaluating the numerator of the energy expeetation value (24.27), we encounter 
similar expressions, of which we give two examples here. 

We begin with the expression 



H </»«('•! )06(''2) 



4;ieor*i 



+ ■ 



+ ■ 



4;reo/'«2 47reo/-a* 47reori2 






X 0a(/-l ) 06(^2) 



(24.43) 



Since the Hamiltonian jfj in (24.43) operates only on 0^, we can use the fact that (pa is a 
solution of the Schrodinger equation (24.30) for the further evaluation of (24.43). 
Evaluating the terms with 3^2 in similar fashion, we arrive at the expression 



ni<?>a(''i)h06('‘2)n 2 Eo- 



(1) 



47r£o''61 



4;T£o/-„2 
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Ane^rab 
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\dVidV2. 

(24.44) 



It is useful in the following to investigate the meanings of the individual terms (1 - 5) 
separately. 

1) 2 £o is the energy of the infinitely separated hydrogen atoms. (24.45) 

2) fl0a(/-i)n )dV, = A<0. (24.46) 

\ 4 7T£o /•<,,/ 



This expression represents the Coulomb interaction energy of electron 1 in state a with 
nucleus b. 



3) ||(06(/-2)|' 



-e 



2 



4neora2 



dV2 = A<0. 



(24.47) 



This integral represents the Coulomb interaction energy of electron 2 in state b with 
nucleus a. From the symmetry of the problem it follows that the two integrals (2) and 
(3) are equal. 

4) Because of the normalisation of the wavefunctions and 0*, we obtain here 

1 . (24.48) 

Aneorab 



This is the energy of Coulomb repulsion of the two nuclei. 



5) J||0fl(£l)Pl06(''2)P 



e 



2 



47T£o^12 



dV,dV2 = E,^,, 



(24.49) 
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This integral represents the repulsive Coulomb energy between the two electron 
clouds. 

Summarising the contributions (24.45 - 49), we find a term from the energy expec- 
tation value of (24.43): 

E/2 = 2Eq+2A+ ^int + ^ . (24.50) 

^UE^rab 



This is, however, still not the total energy expectation value, because on substituting y/^ 
and y/^ into (24.27), we find the occurrence also of exchange terms of the type 

H06(Tl)0a(T2)(. • •)06(''2)0a(Tl)rfKi dVi (24.51) 

in which the parenthetical expression (. . .) is the same as that in (24.43). Explicitly, 
(24.51) is given by 

± 1 1 <t>b(r\) (j)a(r2) 0a(ri) 0i,(/-2) 

/ ^ \ 

X 2£’o- -I- 4- ]dV^dV 2 . (24.52) 

\ ^nsorbi ^nsoraj ^nEorab ‘^nEornJ 

^ '"”(2p 

The terms have the following forms and meanings: 

1) ±2EoS^ (24.53) 

is the energy of the two separated hydrogen atoms multiplied by the square of the 
overlap integral S, which we have already met in (24.7). 

2) ±\d>a(r2)d>b(r2)dV2\0birx) ( - — ) 0a^r,)dV, . (24.54) 

V 47reor6i/ 



D 

This exchange integral is a product of the overlap integral S with the single electron 
exchange integral D (24.9). 

3) ±SD. (24.55) 

The result is the same as in (24.54), except that the indices of the electrons and the 
atoms are interchanged. 

4) ±S ^ — ? . (24.56) 

Ane^rab 



The square of the overlap integral is multiplied by the energy of the Coulomb inter- 
action between the two nuclei. 
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5) ±|j0,(ri)0a(r2)— ^ $a{r^)<Pbir2)dV,dV2= ±Ec^. (24.57) 

4 71 eon'll 



This integral represents the Coulomb interaction between the two electrons, in which 
however the normal charge density does not occur, but, instead, the exchange density. 
It is therefore referred to as the Coulombic exchange interaction. 

The total contribution resulting from (24.53 - 57) is thus 

2 

E/2= ±2EqS^±2DS±Ece±S'^ . (24.58) 

47reo/-«* 



At this point we recall our original goal, which was to calculate the numerator of 
(24.27) using the wavefunctions and y/^. If we multiply out all the terms in y/^ and 
l//g, we obtain integrals of the type (24.43) twice, and integrals of the type (24.51) twice. 
Finally, we must divide the result by the normalisation integral; we then obtain the total 
energy of the hydrogen molecule: 



p = 



E+E 
2(1 ±S^) ’ 



(24.59) 



in which, for the wavefunction y/^ or 1//^ the upper or the lower sign is to be chosen: 



Eg — 2 Eq + 



2 A +£'int 
1 + S^ 



2DS-^Eq^ 

1 + S^ 47r£o^aZ? 



= 2Eq + 



2Aa-^[^ 

1-S^ 



2DS-\~Eq£ ^ 6^ 

1-S^ 4neorab 



(24.60) 

(24.61) 



In order to determine whether the molecule is chemically bound or not, the integrals 
in (24.60) and (24.61) must be numerically evaluated, since different effects are in com- 
petition and it is not immediately clear which of them is predominant. We have partly 
negative and partly positive contributions from the Coulomb interactions of the elec- 
trons between themselves, the nuclei between themselves, and the electrons with the 
nuclei. In addition, typically quantum mechanical effects occur, such as exchange 
interactions, which may be combined in the expression 

K = 2DS + Ece<0. (24.62) 

It should not be our goal here to carry out the numerical evaluation of the integrals. 
Such an evaluation shows that the exchange contributions, for example (24.62), are 
negative. This produces the effect that the even wavefunctions are lower in energy than 
the odd wavefunctions. Furthermore, it is found that the net effect of the various 
Coulomb interactions is to lower the total energy of the hydrogen molecule relative to 
that of two free hydrogen atoms; this lowering, together with the exchange effects 
(24.62), comes about because the electrons can both be between the two nuclei simul- 
taneously. They thus can profit from the attractive Coulombic potentials of both 
nuclei, and in such a way, that the Coulombic repulsion of the electrons for each other 
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Fig. 24.10. The bonding energy of the hydrogen molecule as a 
function of the internuclear distance taking the repulsive 
Coulomb energy between the nuclei into account. Upper curve: 
The electronic spins are parallel. Lower curve: The electronic 
spins are antiparallel 



and of the nuclei for each other are to some extent compensated. The net energy 
lowering depends on the internuclear distance r^b. 

At very small distances, the positive repulsive energy of the two nuclei finally 
becomes dominant. Therefore, the total energy exhibits a minimum at a certain value 
of Lab (Fig. 24.10). The dissociation energy, which is given by the difference between 
the minimum energy at the equilibrium distance and the energy at a distance = oo, 
is, according to the calculation, 3.14 eV. The observed bonding energy, which is equal 
to the dissociation energy, is, by contrast, 4.48 eV, whereby it must still be taken into 
account that the nuclei contribute some energy through their kinetic energy. If this con- 
tribution, whose value we will not calculate here, is subtracted, the bonding energy 
would become even larger, 4.75 eV. We thus see that a considerable discrepancy exists 
between the calculated and the measured bonding energies. This means that the wave- 
functions of the Heitler-London model are still relatively inaccurate. While they show 
us that the hydrogen molecule is, in fact, bound, they can give the shape of the wave- 
functions only very approximately. To improve these wavefunctions several other 
effects must be taken into account more precisely; we will discuss only one typical 
example, which is known as covalent-ionic resonance. 



24.5 Covalent-Ionic Resonance 

In the previous section we employed a wavefunction for the two electrons in which the 
one electron is always in the neighbourhood of the opposite nucleus to the other elec- 
tron. In this case, known as “covalent bonding”, the wavefunction was of the type 

¥cov = N[(l)a{r^)(l)b{r2) + (pair 2 ) 0bir\)] , (24.63) 

where A is a normalisation factor. 

It is of course possible that - at least with a certain probability - one electron is 
localised at the same time as the other on the same atom, in which case the wavefunc- 
tion would be of the type 



0air\)0airi) • 



(24.64) 
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Since the two atomic nuclei are equivalent, both electrons could just as well be near 
atom b, which would lead to the wavefunction 

4>b{ri)(l>b{r2 ) . (24.65) 

The functions (24.64, 65) describe states in which a negatively charged hydrogen atom 
is present; for this reason, they are called '‘ionic'’. Since the states (24.64, 65) are 
energetically degenerate, we must form a linear combination from them, which we will 
do in the symmetric form: 

Wion = {0a{r\) 0a{ri) + $b{r\) 0b{ri)] (24.66) 

so that (24.66) has the same symmetry as (24.63). Now we must expect that in nature 
neither wavefunction (24.63) nor wavefunction (24.66) occurs uniquely, since the elec- 
trons avoid each other for part of the time, but can still be in the neighbourhood of the 
same nucleus at other times. Since both possibilities exist, we must - according to the 
basic rules of quantum mechanics - construct the wavefunction which best fits the ob- 
servations by taking a linear combination of (24.63) and (24.66): 

W~ W ^ W ion •> (24.67) 

where the constant c is still a parameter which must be determined by using the condi- 
tion that the energy expectation value belonging to (24.67) be minimised. 



24,6 The Himd-Melliken-Bloch Theory of Bonding in Hydrogen 

As well as the Heitler-London method, which we have discussed above, a second tech- 
nique is often used in molecular physics; to be sure, it does not yield such good results for 
the bond energies as the Heitler-London method, but it allows the wavefunction of the 
individual electrons to be more closely described. This is of particular interest in molec- 
ular spectroscopy, where in general only a single-electron state is changed by an observed 
transition and one would like to treat just this change theoretically. 

In this method, the fact that two electrons are involved in the problem is at first 
ignored. We consider instead the motion of a single electron in the field of the atomic 
nuclei, or, in other words, we start from the solution to the problem of the hydrogen- 
molecule ion. This solution was derived in Sect. 24.2; the result had the form 

Wlr) = N{0a{r) + 0b{r)]- (24.68) 

The idea is now to insert both of the electrons of the hydrogen molecule one after 
the other into this state (24.68). As a trial solution of the Schrodinger equation with the 
Hamiltonian (24.28) for both electrons we therefore use 

Wiruri) = Wg{r\) Wgi^i) * spin function , (24.69) 

in which we will restrict ourselves to the case that the spins are antiparallel; the spin 
function is antisymmetric, and thus has the form 
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spin function = [a(l) JS(2) - a(2) /S(l)] . (24.70) 

1/2 

The total wavefunction (24.69) .may be seen to be antisymmetric with respect to the 
electronic space and spin coordinates. With the trial solution (24.69), we can again cal- 
culate the expectation value of the total energy; the result is not quite as accurate as that 
of the Heitler-London method. The procedure which we are describing is called the 
‘‘Linear Combination of Atomic Orbitals'' method, abbreviated LCAO. A linear com- 
bination such as that given in (24.68) represents the wavefunction of a single electron in 
the molecule and is also referred to, for this reason, as a molecular orbital 

This technique may be extended to other more complicated molecules, in which case 
the wavefunction is first constructed for each pair of centers (atomic nuclei) and then 
filled with a pair of electrons. The method requires some modifications for many 
molecules, some of the most important and widely applied of which we will describe in 
the following. 



24.7 Hybridisation 

A particularly important case, which is of special interest in organic chemistry, is 
hybridisation. In treating it, we will be led to atoms with several electrons. In the 
formation of chemical bonds and molecules, the electrons of the closed inner shells are 
not strongly affected. Chemical bonding takes place through the outer electrons 
(valence electrons), which are more loosely bound to the nuclei. In the case of the 
carbon atom, of the six electrons, two are in the state, two in the 2s state, and the 
remaining two in two of the three states 2py, and Ip^ (Sect. 19.2). The degeneracy 
of the state with principal quantum number n = 2, which is familiar from the hydrogen 
atom, is lifted here; to be sure, the energy difference between the 2s and the 2p states, 
4 eV, is not too large, and there is in fact an excited state of the carbon atom in which 
an electron is excited out of the 2s state into the 2p level (Sect. 19.2). Then the states 
2s, 2p^, 2py, and 2p^ are all occupied. Let us now consider these states, each of which 
contains a single electron, more carefully, while letting external forces act on the 
electrons, for example by bringing a hydrogen atom into the neighbourhood of the 
carbon atom. Then the energy splitting which still exists between the 2s and the 2p 
states may be more or less compensated by the external forces, so that the 25 and 2p 
states are practically degenerate. 

As we know from perturbation theory in the presence of degeneracy, we must, in 
such a situation, form linear combinations of the old, degenerate functions. For 
example, instead of the 2s and the 2p^ functions, we can construct two new wavefunc- 
tions of the form 



\l/y - (Ps~^0p^y 
- ~ 0s~ • 



(24.71) 



As we have shown earlier, linear combinations of this type mean that the centre of 
charge is shifted as compared to the s function (Fig. 24.11). Just such a phenomenon 
occurs in hybridisation. Let us discuss some types of hybridisation, beginning with the 
most well-known case, that of methane, CH4, where the carbon atom is surrounded by 
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Fig. 24.12. Left: The density 
distributions of the four elec- 
trons in the case of tetragonal 
hybridisation of carbon. 
Right: Exploded view 




Fig, 24.11. Shape of the wavefunctions in diagonal hybridisation {sp hybridisation). The 5 function is shown 
as a dashed line, the p function as a dot-dashed line, and the function resulting from their superposition as a 
full curve. The figure shows clearly how the centre of gravity is displaced to the right due to the superposition 
of the wavefunctions 0^ and 0p 



four hydrogen atoms. It is known experimentally that the carbon atom sits in the centre 
of a tetrahedron whose vertices are occupied by the hydrogen atoms. Interestingly, the 
four degenerate wavefunctions with principal quantum number /? = 2 in the carbon 
atom can form four linear combinations, whose centres of charge are displaced in the 
directions of these four vertices. If we now recall the fact that the wavefunctions of the 
p state have the form f(r)x,f{r)y, and /(r)z, we can see in elementary fashion that the 
following linear combinations generate the charge displacements mentioned above: 

y/x = y(05+ <l)p^+ (j>p^ , 

W2 ” y(05+ 4^p ~ * 

" ^ 24 . 72 ) 

\j/^ = + 0Py~ > 

^4 = y(05"” 0py'^ ‘ 



With these new linear combinations, the electrons of the carbon atom are “adjusted” to 
the tetrahedral environment. Each one of the four wavefunctions (24.72) can take part 
in forming a bond with the corresponding hydrogen atom (Fig. 24.12). For example, 
we select the direction of vertex 1 and denote the wavefunction xf/^ of (24.72) in the 
carbon atom more exactly as that in the hydrogen atom as Similarly to 

the hydrogen molecule, according to the LCAO method, a wavefunction for each of 
the electrons which participates in the bonding can be written in the form 

¥ir) = y/c\ ir) + «0H1 ('■) • (24.73) 

Since the carbon atom and the hydrogen atom are different, the coefficient a will not 
be equal to 1 (in contrast to the hydrogen molecule) and must be determined by a varia- 
tional method. 

In the present case, we have guided our considerations by the experimental evidence, 
which indicates that four hydrogen atoms are located at the vertices of a tetrahedron. 
It is tempting to ask whether, in a theoretical treatment of this problem, the wavefunc- 
tions (24.72) are to be regarded as given in advance, and the hydrogen atoms then 
attach themselves to the vertices of the (pre-existing) tetrahedron, or whether the 
hydrogen atoms first place themselves on the vertices of a tetrahedron and the carbon 
wavefunctions then adjust themselves accordingly. From the viewpoint of quantum 
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Fig. 24.13. Left: The density distri- 
butions of the three electrons in the 
case of trigonal {sp^-) hybridisation 
of carbon. Right: Exploded view 



mechanics, such considerations are superfluous. The position of the hydrogen atoms and 
the orientation of the wavefunctions mutually influence each other. The overall con- 
figuration is chosen by the CH 4 molecule in such a way as to minimise the total energy. 

The tetragonal hybridisation discussed above, i.e. arrangement of the wavefunc- 
tions in tetrahedral symmetry, is not the only possible form of hybridisation in the 
carbon atom. A further form has already been anticipated, namely diagonal hybridisa- 
tion, which is expressed in the wavefunctions (24.71) (Fig. 24.11). 

In carbon, a third type of hybridisation is possible, the trigonal form, in which the 
wavefunctions of 5 -, Py.- and Py types generate preferred orientations in three directions 
within a plane by forming suitable linear combinations. In order to give the reader an 
impression of the form which such linear combinations take, we will give them 
explicitly here (Fig. 24.13): 




Wi = v\(<t>s+ V\<l>Py-V\<l>p) > (24.74) 

= V\i(l>s-V\<l>Py- V\(!>p) ■ 

Clearly, the original fourth wavefunction, p^, is not used in these three wavefunctions. It 
plays an additional role in chemical bonding, as we will see: we consider the case of 
ethene, C 2 H 4 . Here the two carbon atoms each form a trigonal configuration with two 
hydrogen atoms. The hydrogen-carbon bonds are given by wavefunctions of the type 
(24.73), where, e.g. for y/Qx we now take y /2 from (24.74). A carbon-carbon bond is form- 
ed by the first of these wavefunctions, with each carbon atom contributing one electron. 
The electrons in the p^ orbitals remain unused; these orbitals can be employed to form 
linear combinations following the Hund-Mulliken-Bloch method in analogy to the hy- 
drogen molecule, leading to an additional bond between the two carbon atoms. We thus 
finally see that we are dealing with a double carbon-carbon bond (Fig. 24.14). 




Fig. 24.14. Density distribution of the hybridised electrons of carbon in ethene 
C 2 H 4 . Left: The two carbon atoms are located at the two nodes and form, together 
with their respective bound hydrogen atoms, a trigonal configuration. Right: The 
functions of the two carbon atoms, which are oriented perpendicularly to the 
plane of the trigonal bonds, form a second carbon-carbon bond 



24.8 The n Electrons of Benzene, €^£[5 

To conclude our brief excursion into the quantum mechanics of chemical bonding, we 
will treat another typical case, that of benzene, CsHg. As is experimentally known, this 
molecule is planar. The hydrogen atoms lie in the same plane as the carbon atoms 
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which are joined to form a six-membered ring (Fig. 24.15). If we look at a particular 
carbon atom, we find again a trigonal arrangement with respect to the projecting 
hydrogen atom and the two neighbouring carbon atoms. We see that, in a quite similar 
fashion to ethene, one orbital per carbon atom, containing one electron, is left over. 
All such Pz orbitals on the six different carbon atoms are equivalent energetically. An 
electron can thus in principle be in any one of the orbitals. 




Fig. 24.15. Electron density distribution in benzene, C^H^. Left: A series of trigonal arrangements in the six- 
membered ring of benzene (a bonds). Right: The density distribution of the n states, which are formed from 
the orbitals of the six carbon atoms 



Let us now apply the fundamental idea of the LCAO method, i.e. the method of 
linear combinations of atomic orbitals. It tells us to seek the wavefunction for a single 
electron in the field of all the atomic nuclei, here in particular in the field of the six 
carbon nuclei. We then have to deal with a generalisation of the hydrogen molecule, 
where, however, the electron is not shared between two, but rather among six different 
positions. If we denote the p^ orbitals at the six carbon positions as tti, 712, 713, ... , 7^6, 
we thus arrive at a wavefunction of the form 



y/= Ci7ri + C2 7T2+ . . . + C67T6 . (24.75) 

To determine the coefficients Cy, we can utilise the symmetry of the problem. That 
is, if we were to move each carbon atom in the ring one step ahead, the overall potential 
field which acts on the electron we are considering would remain unchanged. We have 
an example of a rotational symmetry, in which the potential field remains unchanged 
when we rotate the molecule by 60°, or a multiple thereof, about an axis perpendicular 
to the plane of the molecule. At this point, we can use the results of Sect. 16.1.2. There 
we found that the wavefunctions in an atom must have a certain form when the Hamil- 
tonian possesses rotational symmetry. It may be shown, as one can verify using the 
discussion in that section, that the wavefunction (24.75) must be multiplied by a factor 
of the form 

^2imn/6 (24.76) 

when we rotate the molecule by an angle of 60° . Here the quantum number m can take 
on the values m = 0, 1, . . . , 5. The relation 

WxoidXtd = ^/^unrotated (24.77) 

results in relations between the coefficients Cj in (24.75): on rotation of the molecule by 
60°, the n wavefunctions are permuted among themselves: 



7T2 — ► 7Ti , 7T3 -> 7T2> • • • j 7Ti ^ 7T6 . 




446 



24. Fundamentals of the Quantum Theory of Chemical Bonding 



We then obtain from (24.77) 



Cl 7T6"I" ^27^1 + C 3 7T2 + • • • 4" C67T5 — e^^^^^^(Ci 7Ti + C27T2 + • • • + ^ 5715 ) • (24.78) 

Since the wavefunctions Uj are linearly independent of one another, (24.78) can only be 
fulfilled if 

C 2 = e^^'^^/^ci , C 3 = e 2 ^"^^/^C 2 , etc. (24.79) 

holds. The solutions of these equations are 

Cy = e2^i'”^/6, (24.80) 

with which we have determined the coefficients in (24.75) up to a normalisation factor. 
If we examine the result of these considerations, we see that in benzene, there are not 
only electrons which are localised between C and H and between C and C atoms, but 
also there are electrons which can move about the entire molecule. Since the phase 
factors in (24.78) represent a wave, each of the n electrons which we refer to here can 
propagate around the benzene molecule like a wave. 

The methods of quantum mechanics which we have introduced here are not only 
applicable to still more complex molecules - they also form the basis for the quantum 
theory of electronic states in solids. In particular, the wavefunctions for electrons in the 
conduction band of a crystal may be regarded as direct generalisations of the benzene 
wavefunctions (24.75) and (24.80). 



Problems 

24.1 The problem of the hydrogen molecule ion can be simplified by following the 
motion of the electron in only one dimension and representing the attractive potential 
of the nuclei by - pd(x-Xa)- Pd(x-Xij). Calculate the wavefunctions for the (two) 
bonding states and discuss their symmetry properties. How does the binding energy 
depend on the distance? 

Hint: Proceed as in Problem 9.3. For the definition of the ^-function cf. Appendix A. 

24.2 Repeat the calculation of Problem 24.1 for the case when the attractive forces P 
of the two nuclei are different and discuss how this affects the probability distribution 
of the position of the electron. 

24.3 To arrive at a model for the n electrons of benzene, one can think of six d poten- 
tials of strength yff, periodically arranged. To imitate the ring, it is required in the one- 
dimensional calculation that the wavefunction \j/(x) repeat itself after six lengths a 
between the potentials: 

^(x+6a) = \i/{x) . 

Calculate \i/(x) and E for the states with E<0. 

Hint: Use Problem 24.1 and (24.77). 
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A. The Dirac Delta Function and the Normalisation 
of the Wavefunction of a Free Particle in Unbounded Space 



The English physicist Dirac introduced a function which is extremely useful for many 
purposes of theoretical physics and mathematics. Precisely speaking, it is a generalised 
function which is only defined under an integral. We shall first give its definition, and 
then discuss its uses. The delta function {S function) is defined by the following 
properties (x is a real variable, - oo g oo): 



1) 


<5(x) = 0 


for X + 0 , 


(A.l) 


2) 


b 

\d{x)dx = 1 


for a<0<b . 


(A.2) 



a 



The d function thus vanishes for all values of x =1= 0, and its integral over every interval 
which contains x = 0 has the value 1. The latter property means, speaking intuitively, 
that the d function must become infinitely large at x = 0. The unusual properties of the 
d function become more understandable when we consider it as the limiting case of 
functions which are more familiar. Such an example is given by the function 



1 

7 = ^ ’ 

l/TTW 

which is shown in Fig. A.l. 



(A.3) 




Fig. A.l. The function (l/]/^«) exp(-JvV«^) plotted against 
the variable x. If we let the parameter u become smaller, the 
value of the function at jc = 0 gets larger and larger and the 
decrease to both sides gets steeper, until the function has 
finally pulled itself together into a d function 
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If we allow u to go to zero, the function becomes narrower and higher until it is 
finally just a “vertical line”. We thus have 



for X 4= 0: lim — e = 0 . 

M-^o ]/nu 

On the other hand, one can find in any integral table the following fact: 



(A.4) 



+ OO 



— OO 







(A.5) 



independently of the value of w. If the limit w ^ 0 is calculated it becomes clear that 
because of (A.4), we can write the integral (A.5) with any finite limits a and b with 
a <0 <b without changing its value. This is just the relation (A. 2). 

In many practical applications in quantum mechanics, the d function occurs as the 
following limit: 



... 1 sinwx 

5(x) = hm 

n X 



(A.6) 




Fig. A. 2. The function s\n{ux)/nx 
plotted against x. If we allow u to 
go to the limit infinity, the value 
of the function at a: = 0 becomes 
larger and larger. At the same 
time, the position of the zero- 
crossing moves towards a: = 0 



The property (A. 2) is found to be fulfilled when we take into account that 



1 sin(wx) 

-OO 7T X 



(A.7) 



The property (A.l) is not so obvious. To demonstrate it, one has to consider that for 
w -► OO, X 4= 0, sin(wx)/x oscillates extremely rapidly back and forth, so that when we 
average the function over even a small region, the value of the function averages out to 
zero (Fig. A.2). 

The d function has, in particular, the following properties: 
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for a continuous function /(x), 
b 

\f{x)d{x-Xo)dx=f{xo), a<Xo<b is valid. (A. 8) 

a 

For a function f{x) which is n times continuously differentiable, 

1/ (x) 8^"\x-Xq)(1x = ( - l)"/("^(xo) , a<XQ<b (A.9) 

a 

holds. Here and mean the «th derivatives w.r.t. x. The proof of (A. 8) follows 
immediately from (A.l, 2). The proof of (A.9) is obtained by «-fold partial integration. 
Furthermore, 

S{cx) = — S{x) , c real (A. 10) 

|c| 

is valid. The relation (A.l) is seen to be fulfilled on both sides. If we insert (A. 10) in 
(A. 2), we find 

b 

\d{cx)dx , 

a 



which, after changing variables using cx = x\ becomes 
\ —d{x')dx\ a' ^b' for c^O, 

a' C 

which is thus, according to (A. 2), equal to 1/ |c |. 

We now turn to the question of the normalisation of wavefunctions in unbounded 
space, where we can limit ourselves to the one-dimensional case without missing any 
essentials. We start with wavefunctions which are normalised in the interval L, 

i//t(x) = (l/l/T)e‘^^, (A.ll) 

for which we have the normalisation integral 

L/l 

I |v/(x)prfx=l. (A. 12) 

-L/2 

If we furthermore assume that if/{x) is periodic, tj/(x+L) = y/(x), the Ar’s must have 
the form 

= « = 0, ±1, ±2,... . (A.13) 

L 



It is easy to convince oneself that 
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L/2 



1 wi{x)Wk'ix)dx = 

-L/2 



^k,k' 




for 

for 



k = k' 
k^k' 



(A.14) 
(A. 15) 



To prove this relation, the integral must be computed, taking account of (A. 13). 
The integral yields 



1 

L 



L/2 



I e 

-L/2 



-\kx-\-\k' X 



1 

\L{k'-k) 



(exp[i(A:' -A:)T/2] -exp[-i(A:'-A:)Z,/2]) . 

(A.16) 



If we now abbreviate k' -k with ^ and L/2 with u, we may write (A.16) in the form 
sin(<^M)/^M . (A.17) 



This is, however, apart from the factor 2 tt/L, just the function which appears in (A.6) 
on the right under the limit, if we identify with x. If we thus divide (A.16) by 2;r/L 
and form lim , we obtain on the left side of (A.16) 

L^oo 

^ L/2 

lim I Qxp(-ikx-^ik'x)dx , (A. 18) 

2n L-^OO -L/2 



which we may also write somewhat differently: 



+ 00 



1 -ikx 


* 


1 ik'x 


l]/2^ i 




l]/2^ J 



dx . 



(A.19) 



The right-hand side of (A.16), using (A.17) and (A.6), goes to d(k' -k). We thus 
finally obtain 



J V^^(x) ii/k’{x)dx = d{k'-k ) , 

— oo 

where 




(A.20) 



(A.21) 



Equation (A.20) with (A.21) generalises the relation (A.14) [with (A. 11)] to the case of 
wavefunctions without finite boundary conditions and thus to the corresponding case 
of continuous k values. As may be seen in all practical applications, the d function in 
(A.20) always occurs under further integrals over k or k' (or both), so that we have 
found a self-consistent formalism. 

Let the wavefunctions in (A.21) depend not upon k, but upon p = hk; then we must 
observe (A. 10). In order to normalise the new wavefunctions 

y/p(x) = 
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correctly, we must set iV equal to (\/\/h) {\/\/^) = {\/]/h). The normalised wavefunc- 
tion is now given by 

Vh 



B. Some Properties of the Hamiltonian Operator, Its Eigenfunctions 
and Its Eigenvalues 

We write the time-independent Schrodinger equation in the form 

J^\l/n = EnWn (B.l) 

with the Hamiltonian 

^ = V V V(r ) , V{r) real . 

2mo 

The y/n(r) are square-integrable eigenfunctions with the eigenvalues Here, = 0 is 
excluded. The eigenvalues E^ may be discrete or they may be continuous. 

In the following, we denote by \i/^ and the wavefunctions on which the operator 
can act. We can now easily read off the following properties: 

a) ^ is a linear operator, i.e. the relation 

holds, where and are some complex numbers. In particular, it follows from this 
that every linear combination of eigenfunctions of with the same eigenvalue E is 
itself an eigenfunction of with the eigenvalue E. 

b) ^ is Hermitian, i.e. the equation 

(B.2) 

is valid. It follows from (B.2) that for the operator of the potential energy, 
F*(r) = F(r). For the kinetic energy operator, (B.2) can be proved by double partial 
integration, taking into account the fact that the wavefunctions vanish at infinity. 

c) The eigenvalues E^ are real. This is a consequence of (B.2), if one inserts for and 
l//vthe same eigenfunction y/n and utilises (B.l). 

d) Eigenfunctions with different eigenvalues are orthogonal. 

We take the following scalar products (different eigenvalues belong to the functions y/^ 
and 
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W^ir)[Jti//n(r)]dV = y/„(r)dV, 


(B.3) 


\[jeWn,ir)]*>^nir)dV = E*\wUr)Wn(r)dV. 


(B.4) 



We subtract (B.4) from (B.3) and use (B.2) and the property that the eigenvalues 
are real: 

0 = {E„-EJ I v/*(r) y/„{r)dV. 

V. V- ' ' ^ 

4:0 0 

The second bracket indicates the orthogonality of the wavefunctions. 

Furthermore, it can be shown that eigenfunctions with the same eigenvalue may 
always be chosen to be orthogonal by using appropriate linear combinations. 



C. Derivation of Heisenberg’s Uncertainty Relation 

For our derivation, it is important to define the uncertainties zlx and Ap which occur 
in (7.24) in a mathematically precise fashion. To this end, we introduce the variances 

{,Axf = {x-xf (C.l) 



and 

(^ = (P-J5)' (C.2) 

where the bar refers to the quantum-mechanical expectation value. We can always 
transform the coordinates so as to obtain 

x = p = 0 . (C.3) 

We can then assume 

(Ax)^ = ? (C.4) 

and 

(^ = P. (C.5) 

With the definitions 

(Ax)^ = I y/*x^y/dx (C.6) 

and 



{Apf= y/*^dx 



(C.7) 
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we can investigate the size of the expression 



For this purpose, we introduce an auxiliary integral: 



m = 1 



dx 



1 

dx>0 , 



(C.8) 



(C.9) 



where is a still-to-be-determined real number. Since the absolute value is greater than 
0, the overall integral as given in (C.9) is also greater than 0. We multiply out the in- 
dividual terms in the integrand and thus obtain: 



m = e\x^w\^dx+\ 



b\!/ 



dx 






(Axf 



2/fc2 



{Apy/h 



(C.IO) 



where the meaning of the first two integrals is indicated beneath them in each case. The 
integrand of the last integral in (C.IO) can be written as a perfect differential, so that 
we obtain for this integral 

f d 

]x — (y/*ii/)dx . (C.ll) 

dx 



Then we carry out an integration by parts, leading to: 

+00 

x\ y/y I dx . 

— oo 



(C.12) 



Since the wavefunction must vanish at infinity, the first summation term in (C.12) 
vanishes, while the second yields 1 because of the normalization condition. With these 
results, (C.IO) can be simplified to yield: 

= ; (C.13) 

this expression is positive for all real values of We now determine the value of ^ for 
which / is a minimum: 

*/(^min)^0 . (C.14) 

If we differentiate (C.13) with respect to (J, we obtain the condition for a minimum: 






(C.15) 




454 



Appendix 



from which we find immediately 

= — (C. 16 ) 

2(Axf 

As can be readily shown, (C.16) in fact gives the true minimum of (C.14). If we now 
insert (C.16) into (C.13) and take (C.14) into account, we obtain: 

+l5(J^>0 (C.17) 

and from this, the relation 

(AxfC^pfsi— . (C.18) 

4 

The right-hand side gives the smallest value which the left-hand side may assume. In 
writing the Heisenberg uncertainty relation, one usually proceeds in a somewhat looser 
manner, taking the square root formally on both sides of (C.18) and setting 

((Ax)Y^ = Ax (C.19) 

and 



((Ap)Y^ = Ap, (C.20) 

thus obtaining 

. . fi 

AxAp>- . 

2 




Solutions to the Problems 



3.2 



2.1 b) 7VA = 6.25-10^^mor^ 

2.2 /•He = 2.08-10“‘®m = 2.08A 



Pressure p [mbar] 


1 


10-2 


10-^ 


Number of collisions 


1706 


17 


0.17 



2.4 Helium: A*He = 1.33 A; mercury: A*Hg = 1.19 A 

2.5 The Bragg Law leads to a lattice-plane spacing of 
rf=2.78 A 

2.6 Debye-Scherrer arrangement; Bragg’s Law gives 
r = 16.1 cm in first order (n = 1) and r = 33.1 cm 
in second order (« = 2). 







3h + 


Mass m 1.67-10“ 

[kg] 


3.35- 10" 


5.02-10“^’ 


Radius r 9.14 

of the cir- 
cular orbit 
[cm] 


12.9 


15.8 


Deflection 14.6 

on the 

screen 

[cm] 


9.39 


7.46 


Parabola method: 






Y = AX^ 




(1) 




H + 


H 2 " 


Mass m [kg] 


1.673 


3.347 - lO-^’ 


A in Eq. (1) [1/m] 


65.27 


130.58 


V (1000 V) [m/s] 
A^(IOOOV) [m] 
r(lOOOV) [m] 


4.376 • lO’ 
1.751 • 10“^ 
2.000 - 10'^ 


3.094- 10^ 

1.238-10-3 

2.000-10-3 


V (4000 V) [m/s] 
(4000 V) [m] 
r (4000 V) [m] 


8.752 • lO’ 
8.753-10-^ 
5.000 - 10-3 


6.188- 103 

6.188- 10-3 
5.000-10-3 



If both positive and negative particles are injected, one obtains 
parabolas in the ( + +y) and {-x, -y) quadrants. 



2.7 Bragg reflection in first order: 0 = 3 .63° . The neu- 
tron wavelength is A = 2.023 -10“^^ m. 

2.8 = 0.35 nm 



3.1 The orbital radius of a charged particle: 

r = momentum filter for q, B constant. 

qB 



With p = ]/2mE (classical energy-momentum re- 
lationship), we have 



Abundance ratio P„ 


of 


to 


after the 


separation step: 








Pn = PoQ" where 




_ _ 


0.754/99.246 


Po 




0.72/99.28 


Amount of 333u 


50% 




99% 


Pn 


1 




99 


n in Eq. (1) 


106 




205 



4.1 When the aluminium foil is replaced by a gold foil 
of the same thickness, Z and in the Rutherford 
scattering formula (4.20) are changed, i.e. 




mass filter for q, B and E con- 



dn 

dt / Ai 



dn 

/ Au 



^AiA^AI ^Au^Au 



qB 

stant. 
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One finds ) =36.11 • f-^ ] , meaning 

\ dt \ dt /ai 

that 36110 particles/second are measured. 

4.2 From the Rutherford scattering formula (4.20) it 
follows that 




e\°] 




10 


45 


90 


135 


180 


( dn\ 
\dt ) 


1 

_ s _ 


10* 


2690.5 


230.8 


79.2 


57.7 



4.3 The kinetic energy of the proton is converted com- 
pletely to potential energy as a result of the Cou- 
lomb interaction. The distance of closest ap- 
proach a is given by 

e^Z 1.138 -lO'^eVm 

a = = . 

4 7r£o^o Eq 

Eo= IMeV: = 1.14 • lO'^^m 

^ 0 = lOMeV: a = 1.14 • lO'^'^m 

The radius of a gold nucleus (cf. Sect. 4.2.5): 

R = (1.3±0.1)yl^/^ • 10"^^ m = 7.6 • 10"^^ m 

i.e. the proton “touches” the nucleus for 
£'o=15MeV. 

4.4 It follows from (4.12) that g = 2arccot 

= 12.49°. ^ 



4.6 Sketch (see also Fig. 4.7): 




a) Integration of (4.8) from infinity (Point A) to 
Point B, the closest approach, with 

^ 180° -0 , vob 

0B = and — cos 0B 

2 a-b 

yields the distance 

_ A77foCOS0B ,2 _2Ze^ 

a*b — b \ K — . 

A:(1-cos0b) 4neo 

Equation (4.12) leads to 

_ k (. 1 \ _ 

''b - , _ ( 1 + — 1 - ''ai + '‘a • 

2£^o V sm0/2 / 

The radius of the Al nucleus is thus 
A-Ai = 2.7-10“‘^m. 

b) From energy conservation, 

Eq = —mv \+ — ; (1) 

2 /*B 

from momentum conservation, v^b = i?b (2) 

Equation (4.12) b = ^ cot 6/2 . (3) 

2Eq 



4.5 It follows from (4.12) that b = 2.16 • 10 m. 



Elimination of i^b and b from (1 - 3) yields 
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_ A: / 1 \ 5.2 

2£'o \ ^ sin0/2/ 

2 2 

c) hyperbolic orbits — _ = 1 . 

5.3 

Fro m the sketch, we see that r^ = a + c = 
a+\/a^+b^ ; with b = {2Ze^/4neomvl) cot 9/2 ^ ^ 

(Equation 4.12) and a = b tan 9/2 we find 

k (, 1 

/"b ( 1 + 

2£’oV sin 0/2 

4.7 a) The Rutherford scattering formula for protons 
(cf. 4.20): 



/^photon me h/X ■E'photon/^ 

^photon ~ 1 ~^Pphoton ~ 5.34 • 10 kg m S 
A = 12400A= 1.24- 10"®m 

Mass reduction: m = E/c^ = Pt/c^ 

= 3.504-10“* kg 



a)£’kin,e- =-£'kin,e+ = | E'y- w = 0.489 McV 
= 7.83 • 10"'^ J 



blE = mc^ = 



mpc^ 

l/l -rVe^ 




1 - 



E ) 



0.86 



( dn\_ Z^e^DN 
\ndQ)~ nEpfEl%m\9/2) 



( 1 ) 



Integrating over the angle 0 {dQ = %mOd6d0) 
gives 

( dn\_ Z^e^DNcos{9/2) 

\nd9)~ 6AnelEls\n\9/2) 

i.e. £ 0 = 5.62 • 10“‘* J = 3.51 MeV. 

b) Setting 

dQ dQ 

in (1), we obtain the differential cross section 

— = 1.052 • 10“^’' m^= 10.52 barn . 
dQ 

c) From (4.12), with k = kp = ZeV4 n Cq, we ob- 
tain 

(? = 2.81 •10“’''m . 



4.8 Eleetrons are not subject to the strong interaction, 
and would therefore exhibit no anomalous Ruther- 
ford scattering. Neutrons have no electric charge. 

5.1 .^photon hc/X mpk c > mpjj h/Xc 



5.5 IFy = Ac/A = 1.136- 10“'* J = 7.09 eV. 

This corresponds to the binding energy of the O 2 
molecule. 



5.6 Photon energy: 

= Ac/A = 3. 31- 10“ '^J = 2.07 eV . 



5.7 



5.8 



Photon flux for a radiant power of 
P= 1.8 - 10“'*W: 



dt Ey 



Frequency v [Hz] 


10 ^ 




2.42 - 10^’ 


Photon energy h v [eV] 


4.14 


• 10“'^ 


10 ’ 


Number of photons per 


1.51 


• 10^® 


6 . 2 - 10 " 


VC? and second 









Photon momentum Py = rtiyC = Ey/c. 

The radiation pressure for perpendicular inci- 
dence and complete absorption: 



APy _ AEy 

At AA cAtAA 

^^ = 4.61 - 10 “" 

P 

^ atm 



= 4.67-10‘^kg/ms^ 



The force on an area of 1 m^: 
F=Py^iA =A.bl - 10“®N . 
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Upon complete reflection, the momentum trans- 
fer is twice as large; Prad and F are doubled. 




-5- = 5800K . 
a 



5.9 a) Momentum conservation: 



Ey 



1/2 -^atom ^atom ^atom 
C 






Integration of the spectral energy density 
m(v, T) dv from (5.6) gives the total energy density 
in the interior of a black body: 



b) Mercury: 

£'atom= 1-02 = 6.4 -lO-^eV . 

From (7.29) the energy-time uncertainty relation: 
AE-At^h. 

With T = t = 10“* s, we have 
JF: » 6.6 • 10-2^ J>£atom- 

-► No effect on the position of the line, reabsorp- 
tion of the photon by other mercury atoms is pos- 
sible. 



t/= I n(v) dv = ^ T* = 0.854 W . 

0 15c /i 

The energy radiated from the sun is generated in 
the sun’s core and transported outwards. There- 
fore, /s gets larger and larger towards the interior, 
and T and U increase strongly. 

5.12 Using Wien’s displacement law (5.4) 

-^max T = 0.29 cm K , A„,ax = 9.67 pm . 



c) Nickel: 



Eaton, = 2.59 • 10'** J = 16.2 eV . 



^atom 



With x = At =\0 s, we have 
ZIE« 6.6- 10-20 J^Eatom- 

^ Noticeable line shift, reabsorption of the y 
quantum by other (stationary) Ni atoms is not 
possible. 

5.10 The Stefan-Boltzmann law gives the power radiat- 
ed by the sphere: 

P=nd^aT\ r=487K or 214°C . 

Annual loss of mass 



/lm = — = — =3.5-10-*kg . 



Monochromatic energy density (5.6) 

8nhv^ 1 

= 1.3-lO-*0js/m*«O.8eVs/m* . 

5.13 hv = mv^/2 + eVf^ = 46V , A = c/v = 310nm 

5.14 a) Braking voltage from (5.28): 

hv he 

Vra^ = -Fa = — -Fa=1.2V . 



Ac 



b) ES“ = cF,„ax= 1.2 ev = 1.92 -lO-'Oj , 



^ max _ 



2 Pkin 



= 6.5 • 10^ m/s 



m 



5.11 The intensity of radiation at the sun’s surface Is is 

P = 7e4 nr^ = Is4nR^ and therefore 
/s = 6.4*10^W/m^ . 



5.15 a) Assuming that each photon releases an elec- 
tron, the number of electrons per s and area Js is 



= 7 

At • A 



JL^ = 6.04*10^sm^)-^ . 
he 



From the Stefan-Boltzmann law (5.3), we have for 
the temperature 



b) The energy absorbed per unit of surface area 
and unit time is 
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AW _ AN-eVA 
At A At -A 



1.208 •10’‘'eV/m^s . 



c) £'kin= /iv-6Fa= 1.77 • 10“‘^ J = 1.1 eV . 



5.16 Table of values: 



V [10“* Hz] 



E,-=AEy = E ; ^ = -^. (2) 

l + (^(l-cos0) niQC^ 



From (1) and (2) it follows that 



A + VA^ + l 



« 0.184 ; A = 



2 mlc^ 



> Energy and wavelength of the incident photon 



E = moc^^ = 94 keV , A = hc/E = 0.132 A . 



5.19 Energy and momentum conservation: 



Equation (5.27): V=- 



E-E' =E, 



E E' 



■cos 6 = Pe cos 0 



Fitting using the least-squares method gives 



sin 0 = sin 0 . 



-- = (4.111 ±0.012)- 10“*' Js/C 
e 



e- : Pe.E, 



Fa = (1. 8939 ± 0.0076) V . 

a) F = 0: Viim = = (4.607 ± 0.032) • 10‘^ Hz 

h/e 

b) 1Fa = cFa = (3.034±0.012)-10-’^J 

c) /i/e = (4.111 ±0.012) • 10“'^ Js/C . 



5.17 Na D lines: An *= 589 nm 



>hv = — = eV with F=2.11V 
A 



y : E,v,X 




Y':E',v:X' 



Relativistic energy-momentum relation: 



pl = ^{El+2mo c^Ee) 



4.14-10“*^ Js/C 



a)(l)^±(2)^-(l)Vc^ and (4) yield 



5.18 Calculation of the kinetic energy of an electron 
from the orbital data: 



D i 2 

evB = mn *Ee- = — ninv = 

R 2 2 mo 



« 7.9 keV 



l±(^(l-COS 0) 



; <^ = - 



thus, the energy loss of the photon is 



AE., = E-E' =£’_^9_E2i^ = 96.8eV 



l±^(l-cos 0) 



From Problem 5.19 we have 



and the change in its frequency and wavelength is 
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jv = -^ = 2.34-10'®Hz , 
h 



/jA = — — (l-cos0) = 1.21 . 
moc 



— = 2.79- IQ-’m . 
4 ndgg 



b) (2)^ + (3)^ and (5) yield 

„2 ^ 2 .2 '(1 - cos d) [2 + ^ (2 + 0(1- cos 0)] 

(1 + 01 - cos 6)f 

This gives for the electron’s momentum 
Pq = 5.31 • lO'^'^kgm/s, and for its kinetic ener- 
gy, 

Ep = AE, = 96.8 qV . 



Equations (2), (3) and (5) give 



Motion after field reversal 



. F 4 . 

= Fvisc = 6 nrjrv = e — + — nr^gg 
d 3 



F^2,y.=Fi , 



^electr. ^grav. > 

giving 

1 / K 4 3 

v = e — + —nrQg 

6 ngr \ d 3 

= 1.67 -10'^ m/s . 



cot0 = (<J + l) 



1 - cos 0 



1 + cos 0 



leading to 0= 59.5°. 



5.20 (Compare Problem 5.19) 



a) zl A = (1 - cos 0) = 0.0243 A . 

rtiQC 

b) £„„-zl£, = E , 

1 + {E/mo c^) (1 - cos 9) 



>£'h„ = 293.7 eV . 



c) (1 A, 90°) = = 2.37«7o 

E 12409 eV 



d)ii^(0.1 A,90°) = 19.5% . 
E 



5.21 A = 355 nm, n = 1.8-10’'’ 

P= 3.3-10’ Watt 
I = 200 Watt 

6.1 a) For levitation equilibrium 

Ft = Feiectr. = 6^ = A-npQg = Fg^av. = FI , (1) 
d 3 



b) The correction introduces an additional term 
1/(1 + AX/r) on the left side of (2). Eliminating r 
from (1) and (2) yields 



ndgg 



-AX+ A^X^- 



6.2 Radius of the flight path r = 1.668 m 



Mass of the particle m 






= 1.86-10“^* kg 

Rest energy F = me’ = 104.5 MeV . 

The particle is most probably a muon. 

6.3 b) The electrons leave the apparatus without de- 
flection when the midpoints of both condensers 
are reached just at the moment of opposite zero- 
crossings of the high frequency field. 

Possible frequencies: 

n = 0, 1,2 . . . . 



C) VMn = = 1/ 2 ^ • 
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6.4 ■fi'ges “ -^0 -^kin 



moc 






■^kin ~ ^0 ^0 ^ 

2 



t) = i^c« 0.866 c 



^ go+^kin ^ j ^ ^ 2.96 

mo Eo WqC 

«.6 ^,A,_^kW^ = 1.8i8.l0-“m 

p m^c v/c 



6.7 Relativistic relationship A(£'kin): 

^ _ -^kin 

Eo ]/k^+ 2K ’ Eo 



After Fourier transformation, one obtains the 
spatial wavefunction of the free particle for t>0: 



y/(x, t) = 



moAk 

]/n{mo+ihtAk^) 

^ x^moAk^ 



• exp ^ - 



2{mo+ihtAk 




From it, the probability density for the particle 
can be calculated to be: 



I t) I ■ 



moAk 

y n{mo + Ak^) 

x^moAk^ 



exp 



(rrio + h^t^Ak*) 



Scattering angle a in the 1st order from Bragg’s 
Law: 

A = 2 c? sin— . 

2 



^kin 1 


10^ 


10^ 


10^ 


[eV] 








A[m] 1.227- 


10“’ 1.227 • 


10“ 3.877' 


10“" 3.702-10“'^ 


«n - 


33.15 


10.35 


0.99 



For scattering angle a = 30° : E^m =121.4 eV. 
6.8 ^kin (298 K) = I kT= 6.17 ■ 10“^' J = 38.5 meV 



The normalisation remains valid for all times, 
since every solution of a time-dependent Schro- 
dinger equation remains normalised. This can be 
demonstrated in general by calculating the time 
dependence of the normalisation integral and us- 
ing the fact that the wavefunctions obey the 
SchrOdinger equation. If the wavefunction is 
known as an explicit function of x and t, one 
could naturally calculate the normalisation inte- 
gral for any arbitrary time; this method, however, 
demands considerably more calculational effort. 
The probability density is a Gaussian function 
having a width at arbitrary time t of: 



k= _ = 1.458 -IQ-^^m 

l/2w£'kin 

Bragg reflection in 1st order at 0= 15.0°. 

6.9 k = — = ^ (nonrelativistic) 

P \/2 mF'kin 

A(a = Im) = 3.232- 10“^ m 

A(cr = 0.5 • 10“*‘’m) = 2.287 • 10“ ^°m 

7.1 The normalisation constant N is 
N=C2Ak)-'^^^n-^'\ 



Ax{t) = — i — ymo + h^t^Ak^^ ; 
nioAk 

thus, the wave packet spreads out with time. 

The factor = exp( - (k/A k)^) is the probabili- 
ty density in A:-space, i.e. it denotes the probability 
of finding the particle with a momentum = /i/: as 
a result of a momentum measurement. 



7.2 Ax = 



1 k^t^(Akf 

(Akf ml 



For / = 0 we have: Ak = cm 
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rriQ 

hAk 



{Axf- 



1 

(217 



1.5 • lO^^s 

^ * 1.3 • 10"^ s for an electron. 



7.3 In three dimensions, we use a trial wavefunction 
for the particle consisting of the product of the 
three one-dimensional wavefunctions from Prob- 
lem 7.1: 

t) = y/(x, t) y/{y, t) ^(z, t) . 

This is correct for arbitrary times, since the 
Hamiltonian is the sum of three one-dimensional 
operators, each of which commutes with the 
others: 



8.3 Practically all the atoms are in their ground states 
before the absorption; one obtains the lines of the 
Lyman series 

Ey = hcv = hcRi[(\-^^^ , 



A 



1 1 




az = 2,3,. 



8.4 a) Diffraction of mth order by a grating: 
A = (d/m) sin0 gives the wavelength of the Pal- 
mer line, A = 486.08 nm and the initial state level 
n = A. 

b) The Balmer lines from the n = 32 and a 2 = 31 
states must still be able to be separated, giving the 
necessary resolution A : 



2mQ\dx^ dy^ dz^ 



A=- 



v(n = 32) 



v(« = 32)-v(a2 = 31) 



= 3890 = m- p , 



The three-dimensional Schrodinger equation thus 
is decomposed into three mutually independent, 
one-dimensional Schrodinger equations, whose 
solutions were already found in Problem 7.1. 
Thus, the wave packet spreads out in each of the 
three spatial directions: 

4 

Ax{t) = ]/niQ + h^t^Akx > 

rrioAk^ 

Ay{t) = ^ y ml + h^t^Aky , 

moAky 

Az(t) = ]/ml + h^t^Ak* . 

ntoAk^ 



where m is the diffraction order and p the number 
or rulings of the grating; the latter must thus be at 
least 3890. 



8.5 From Sect. 8.4 we have 

p — ^ . p — — ^ 

■^kin ^ > -^pot ~ 

2r„ r„ 

therefore |£pot| . 



Ze^ . 
4 nso 



Total energy E„ = £'pot + £kin = - («/2 r„). 

This amount of energy is released, e.g. by the 
emission of photons, when a Bohr atom is 
formed. 



8.1 £’kin= 1.387 • 10”“ J = 8.657 • 10”* eV 
V = 4.07 ms”‘ 



^pot _ 

Ekm a/2 r„ 



- 2 independently of n. 



A [A] 


V [cm ^ ] 


v/Rh 


n 


3669.42 


27252.3 


0.24848 


>20 


3770.06 


26524.8 


0.24184 


>10 


3835.40 


26072.9 


0.23772 


9 


3970.07 


25188.5 


0.22966 


7 


4340.47 


23039.0 


0.21006 


5 



8.6 a)F^=m — = G 



Mm 



1 



b)£ki„ = -«Jf 

</)Epot= - 



1 ^ Mm 



2 2 r 

GMm 



( 1 ) 




Solutions to the Problems 



463 



-^tot “ ^pot ■*" -^kin “ ^ ■ 



e) Quantisation of orbital angular momentum 



L = mvr= nh 



f) (1) yields 



8.9 a) E„in) 






b) r„ = ^ 2.56 . 10"^ A • — 

Ze Z 

c) /! V = £’2(a/) - £■! c«) = 21 10 eV • 



t)V = GM 



8.10 With co„ from (8.9) and zl t = 10 * s, we have 



(2) and (3) lead to 



N= — -(o = 6.5%i ■\0’ -X 



GMm 1 - . ,^_34 ^ 1 

v„ = • — = 9.654 • 10 m/s • — 

h n n 



GMm^ 



1.199 -lO^’m-rt^ . 



8.7 For a ring current: 



/--i£i=-1.055.10-SA. ‘ 



a)rt = 2givesN=8.228-10^ ; 
,, 4.5-10’ ... 



b)« = 15gives7V= 1.950-10'* ; 
/:= 230800 . 



/ 1 1 

8.11 Pickering series: Vp = 4 Rhc — t “ — T 

\ 4^ 



and the magnetic moment 



Rydberg constant: Rhc = 



I I 1 2 ^ 1,1 

\^\ = eojr =- 1/| =- n = ^^n 

2 2 mo 2 mo 



(compare Sect. 12.2). 



1.055 • 10 ’ 
1 . 318 - 10 “ 
3.905 • 10 “ 



\fi\ [Am^] 

9 . 274 - 10 “ 
1.855 - 10 “^^ 
2 . 782 - 10 “^^ 



-^ = = 3.288 - 10*^ S-* - 
In 6An^ e:lh^ n 



Snsoh^ 



Ai^= 1.059* 10“^^m-«^ 



1st line:/! = 5 ; zlv = 0.443 cm ^ ; 

Zl£’=5.5*10“^eV 

3rd line:/! = 7 ; zl v = 0.828 cm ; 

Zl£’=1.03*10~^eV . 



8.12 Hydrogen-like atoms: 






Rh= 109677.581 cm “* ; 
Rhc= 109722.398 cm“* . 



r^-T • — r = “ 6.80 eV - ^ 

64 7T^£o^^ 



where r is the distance between the e and e'^ . 
(For the calculation, see Sect. 8.4.) 



We find 400 nm < A < 700 nm for 

Hydrogen with « = 3, 4, 5, 6 i.e.«'=2 

Helium^ with « = 4 n' =3 

n = 6..A3 n'=4 

« = 12...oo n' =5 . 
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8.13 The relative deviation y between the Sommerfeld 
energy formula (8.29) and the Bohr energy 
formula in the hydrogen atom for « = 2: 



E2,,-E2 _a^f2 3 

E2 4 U 4 



k 


1 


2 




y 


1.67- 10“’ 


3.33-10“ 


6 


Wavelength A 2 of the second laser 
A,- - 




Ei(H)(-^-iyil.5cV 


«2 


£„2teV] 




A 2 [nm] 


20 


1 

Ui 

0 

0 

1 


212 


603.5 


30 


-1.51 10~2 


All 


598.0 


40 


-0.85-10“^ 


848 


596.1 


50 


-0.54-10-^ 


1325 


595.3 


Required linewidth AE - 


0 


= 2-10“^eV. 




_ co„ _ e‘*mo 


_ 6.58 


•lO‘^s-1 



In 32 So 



b)v„^„_i = 






32n^slh^n^ 



1 - 



1 

2 n 



, 2 1 
1- — + -^ 



n n 



c) limv„^„_i = v„ 

«-► cx 

(compare Sect. 8.11: correspondence principle) 

8.16 The motion of the nucleus can be taken into ac- 
count by replacing the electron’s mass mo with the 
reduced mass 

jj, = moM/{mQ +M) (M = nuclear mass) . 



mo _ 


1 




mp 


1836.15 




Atom 




AE{Z,n) 
E{Z, n) 


'H 




5.45 • 10““ 






2.75 • 10““ 


’H 




0 

00 


‘'He+ 




o^ 

0 

1 






p 

00 

0 

1 

.p>. 



8.17 n = \-^n = 2; « = l->« = 3 

8.18 « = 3, 4, 5, 6; plot as a function of n 

8.19 (Bohr)* ///me£r. Spectrum: similar to the 
Balmer series. 



9.1 The Schrodinger equation of the force-free parti- 
cle is: 



ih — t) = 
8/ 



2/Wo dx^ 



y/(x,t) . 



Both sides of the equation can be calculated by ex- 
changing the differentiations with the integrations 
in A:-space. Substituting the given dispersion rela- 
tion for the free particle then demonstrates the 
equality of the left and the right sides. 



9.2 a) By substituting {//(r, t) into (9.32) . 
b) We require j 1 . 

Since f 0/ dV = djk , 

I Cl I C*2 P = 1 must hold , 

or, in general, ^\Cj\^=l . 

j 



9.3 From the requirement that the wavefunction be 
normalisable, we find from the time-independent 
Schrodinger equation: 



En{A,Z) = 



Em 



1 



mo 



i+— 

A m 



y/{x) = 



Nexp(kx) , 
Nexp(-kx) , 



with k = 



2 m^E 



x<0 

x>0 



(A - atomic mass number) . 
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The normalisation factor of the exponential func- 
tion must be the same on both sides, in order for 
the wavefunction to remain continuous at x = 0. 
The jump condition is 



V/'(e)-i^'(-e)= <y(0) . 

h 



From this, one finally obtains k = rtiQ p/h^, and 
the energy eigenvalue E= -{nioP^/lh^). The 
normalisation factor isN= ]/k. 

9.4 y/i , x<-L 

=^2COs(A:x)-i-52sin(Arx) , -L<x<L 

V'lii = > x>L 



K = 



— IrtioE _ j 2 mQ {E+ Vq) 
' / h'^ 



b\^ = 



1 






I a I ^ and 1 6 1 ^ are independent of the sign of pi 



9.6 The wall is located at the position x = 0. 

Since y/(x) must be continuous at x = 0, we find 

= 0 . 

For the wavefunctions, the result is: 



y/(x) = 



A sin(Ar, x) 
0 



x<0 

x>0 



with k = 



2 m^E 

~1F~ 



where E is the energy of the particle. 



The requirements that the solutions be continuous 
and differentiable at x = ±L yield: 

K = k tan {kL) -^E(L, Vq) (graphically or 

numerically) , 

and the two types of solutions: 



9.5 a = 



b = 



= Bs , 


fi2 = 0 


symmetric solution; 


= -B, , 

1 


B2*0 

-1 


antisymmetric 
solution . 


. moP 
ib^k 






1 


, with 


k = 


/ Im^E 


moP 

\h^k 




/ 



The meanings of a and b\ 

I a I ^ = reflection coefficient, 

I b P = transmission coefficient. 



kl' = - 



1 



h^k^ 

mlp^ 



-f-1 



9.7 {x) = \y/*{x, t)xy/{x, t)dx = 0 



{p) = \y/*{x,t)\^\h—j y/{x, t)dx = 0 

<£'kin> = -^-^= ^ y/(x,t)dx 

IrtiQ \ dx ) 

h^(Akf 



4 Wo 

<x^> = \ y/*{x, t)x^y/{x, t)dx 
1 h(Akf , 



2(,Aky 






2ml 



<x^> illustrates directly the spreading out of the 
wave packet. 



9.8 E = 



1 .. h^k^ 



1 dk- 



\/nAk 2/Wo 



txp{-k^/{Ak)^ 



9.9 [4,f^]= 6 

I V dz 9y / \ 9x dz , 



a 9 0 a 

- 1 z X — ]{y — -Z — 

ax az / \ dz ay 



a 



a 



hUy—-x—} = ihl, 
L 9x ay j 
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[ty, 4] and [4, 4] analogously. 



partial integration: 



= ty[ly,U + [Jy, 1,] Ty + f,] + [^ , Q I 

= -ihlylz-ihlzTy^ihlJy+ihlylz 



— ^ \dx( ]y/'^ 

2mo \ dx \dx ^ 

partial integration: 



[P, ly] and [/^, 4] analogously. 



9.10 X] = 0 and [A,, F(r)] = 0 

9.11 a) {aby = a* b* with x and dx real. 



°° Pi r\ 

b) I = 

-00 1 8x 



— 1^2 
1 



1 1 

\dx (//*/?(// = {p > 

A7?o rriQ 

— {p)= -\h\\u'^ ^—\u dx-\Pi\xu'^ dx 
dt ' dx ' dx 



Substitute in the Schrodinger equation and inte- 
grate by parts two times: 



I y/*dX 

-00 1 8x 



-^ip) = \dxy/*(--^v\i//= - 
dt \ dx / \dx / 



/ ? ♦ /j 6 . ' 

= 1 ¥i — —Vidx 

\-oo 1 6x ) 



C) j y/*-^xi,^dx = 

dx^ 



* 9 

¥*—¥2 

dx 






-^{¥*)¥2 

dx 



+ I ¥2-^¥*dx 

-00 dx^ 



In three dimensions, we have for each component 
/: 

rriQ—{Xi) = </?,) , — <A> = - <- — >. 

dt dt \dXi / 

9.13 The potential is V(x) = mn(o^(x-Xn)^-eo , 

2 

•*u ^0 A 1 ^0 

with Xq = — and Cq = ■ 

k 2 k 

The time-independent SchrOdinger equation is 



then 



^ 1 2, x2 

2 " —moCo\x-XQ)- So 



2 mo dx 2 



¥{x) 



= ( J ¥*-^¥\dx 

\-oo dx^ 

d 

9.12 — <x> = \ \i/*x\f/ dx+\\i/*x\i/ dx 
dt 



with y/ = ( - 



d^ 



\h \ IrriQ dx 
Substituting: 



+ F (// 



= E\i/{x) . 

Employing the transformations y = x-x^ and 
£ = £■+ £o> we obtain the well-known form 



d^ 1 



2 mo dy 2 



j + —moOJ^y^ 



y/{y) = E\i/{y) . 



The solutions of this equation are transformed 
back, and we find for the eigenfunctions: 



i/i r . / d^ / d^ 






1/4 



¥nix)={^^^^] exp 






2h 
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mQO) 

~ir 






with eigenvalues: 

\ 2/ 2 k 



1/8 



1 



8 



9.14 b = ~{ — + ^\ ; 6+= — (- — +^1; 

V^\8^ 7 i/^V a<^ 7 






moO) 






-(-— +7(— 

V 9<? /\9^ ) 



1 8^ ^8 8,8^ 

2 i 8^2 8^ 8.^ 8^^ 



-^A+A^ + ,^2-^2 
9^ 8r 



jr9,9jr i;^ 44 

= +(J 4-1 = 1. 



9.15 i//{t) = Qxpy-i^tj[i//Q+ if/xQxp{-iojt)] 

Since are real, we obtain 

\wiO\^= ¥o + ¥i + ^Wo¥iCOs{(ot) . 

If one chooses e.g. tn = 2 nn/a>, with 
« = 0, 1 , 2, . . . , the result is 

\witn)\^ = i¥o + ¥\f ^ and 

\¥{t„ + n/oj)\^ = i¥o-¥\f • 

The wavefunction thus oscillates periodically be- 
tween the two forms. 



9.16 a) J ^-^jt.(Ocf^=[M(Oi>(0]-oo 

For functions which vanish at infinity, then. 



h)\{b^ 0„r {b^ (t>„)d^=\0tbb^ <t>„d^ 

= \0*(\+b^b)d>„d^ 

= (l+n)|0*0„c?(^ 

c)\<t>U,d>„^,dlt = -^\{b^d>„r (b^0„)d^ 

n + \ 

with(b), =\<l>t¥nd^ 



d) 0„ = 






-ib'^)"0o follows immediately 



from (c). 



e) With + i = 

b'^ 0n = ^n + \ \ 



-b^ (!)n we have 



from = b^ 0n-\ we have 

17/ 

b(p„ = -^bb^ (p„_i = -^{\ + b* b)¥„. 
]/n ]/n 



\ + n-\ 



and therefore: 

b0„=l/«0„_l . 



<t>n-\ ; 



f) Assuming 

K„=[b, (&+)"] = «(* + )"-' 

is valid for a particular n, then it follows that 
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^«+i = [b, = b(b^fb^-{b^f^^b 

= {{b^)"b+K„)b^ 
-( 6+)”+‘6 

= {b^fbb^+n{b^y-‘^b^ 
-(b^Y^'b 

= ( 6 + )"(*+& + !) + «(&+)" 

= {n + \){b^Y . 

Thus, the assumption is also valid for n + 1 . Since 
it is valid for n = 1 , it is valid for all n ! 
Analogously: 

Qn=[b\b'']= -nb"-^ 

is supposed to be valid for a particular n. Then we 
also have 

Q„+, = 6 '^ 

= {b”b^ + Q„)b-b''*‘^b* 

= b"b^b-nb"-b'’*^b^ 

= b"{bb-^ -l)-nb''-b'’-^'b-^ 

= -in + l)b" . 

Thus, the assumption is also valid for n + 1. Since 
it is valid for /? = 1 , it is also valid for all n ! 



Analogously for the potential energy: 
:^x^~^{b + b^Y = iib^b^ + bb + 2b*b+l) 




9.18 To be shown: 

Wt<Pmd^= • 

0„ and <p„ are solutions of the SchrOdinger equa- 
tions 

b* b<t>n = n(l)„ and b'*' b0m = m0m • 
Multiplying by or , we obtain 

\0*b^b(p„d^ = n\<p*0„di and (1) 

\{b^b(pj*^„d^= m^0*,0„di^ 

= \0 *b^b 4>„d^ 

= m\0*,0„d^ . ( 2 ) 

Taking the difference between (1) and (2), we find 

0 = (n-m)\4>*,(/>„d^ , 

i.e. the eigenfunctions are orthogonal and - with 
Problem 9.16c - are normalised. 



9.17 Momentum p --(b-b^ )/i, position x — (6 + Z? ) 
with 

\(p*b(t>„d^=Yn\(l)* 0„_id^ = O and 
it follows that 

\4>*p0„dJ^=l^*x^„d^ = O. 

Kinetic energy: 



9.19 a) as in Problem 9.16e 

b) as in Problem 9.18 

c) , d) as in Problem 9.17. 

rripe* 
(4neofh^ 

1 

4 (47r£o)^A^ 



10.1 a)£,i„ = l 

2 (4nEorh^ 



b) £'kin - 



8 ( 47 r£o) 



\p^~-\{b-b^f 

= -\(bb + b^ b^ -bb* -b^ b) 

= -\{b*b* + bb-2b^b-\) . 

isthen \<Pt(^b^ b+ ^(p„d^ 

= i(« + |) • 



10.2 a) D = (0, 0, 0) 



b) 1. D = (0, 0, 0) 

2. D = d/]/^{0, 0,1) 

3. D = |l/^rf(-l, ±i,0) 



d 



256 

81 



1 . 

—r 

1/6 triQe 
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10.3 



10.4 



10.5 



11.1 



11.2 



Ki = [0.529- 10“* cm] £i=-13.55eV 

K2= [1.058- 10”* cm]-' £'2=-3.39eV 

K 3 = [1.587 -10”* cm]”' £’3=-1.51eV 

a) N={2/nf\ro)-^^^ 

'•omin = 67 t*^^£o ^V(]/^e^mo) 



Emm = -e*mQ/{\2h^n^el) = —E(, 

3n 

b) N={nrl)-^^^ 

''omin = 47reo^V(moP^) 

-Emin= -e‘^mQ/02h^n'^El) = EQ 
Eq: exact ground-state energy. 

Differential equation for g{x): 

g{x) + 2(—-]/i\-^ g{x) 
dx^ \x / dx 

+ l^(ci-2tT|A) = 0 
X 

Power series expansion for g(x) as trial solution: 
g(x)= I a„x" 

n = 0 

Recursion formula for the expansion coefficients 
a„: 

a„+i = a„{2n]/E-Ci + 2a\/e)/[(n + l)in + 2(T)] 



Energy eigenvalues: 








E„ = 


mocj 


n + 


1 1 

— + — 


\/i + SmoC2/h^ 


-2 




2h^ 


- 


2 2 


- 






En.i [eV] 


/=0 


l=\ 


1 = 2 


Li: 


n = 2 




-5.31 


-3.54 


— 




n = 3 




-2.01 


-1.55 


-1.51 




n = 4 




-1.05 


-0.87 


-0.85 


Na: 


n = 3 




-5.12 


-3.03 


-1.52 




n = 4 




-1.97 


-1.40 


-0.85 




n = 5 




-1.03 


-0.80 


-0.55 



En.l- - 



^alkali ^ ^ 



{n-A{l)Y 
Zl(0) = 0.41; zl(l) = 0.04 



Transition 


V [cm ^ ] 


A[nm] 


Region 


3/7 ->-25 


31000 


323 


UV 


3/7 ->35 


3800 


2619 


far IR 


35->2p 


12200 


820 


IR 


2/7 ->25 


14900 


671 


visible 



11.3 







3^D 


j2p 


Principal quantum 
number 


n 


3 


3 


Spin quantum number 


5 


1/2 


1/2 


Angular momentum 
quantum number 


/ 


2 


1 



Three lines result from this transition: 



3'Z?5/2- 



YD 

3^D 



3/2 ■ 



3 / 2 ' 



•3'P3/2 

^3'P3/2 

.3^Pi/2 



The new index represents the total angular mo- 
mentum quantum number J = L±l/2. 



12.1 Electron: 

a)i^ = 2nvL = 
Proton: 

OJl = 271Vl = 



2]LIbB 



= 3.52-10'^s 



, 6.-1 



5.585 ju^ B 
h 



= 5345 s”' 



with jjL^ = //b/ 1836. 
12.2 Result: 



mdv^ 



285,/6z/i(/2+/i/2) 



9.32- 10”^ Am^ 



The nuclear spin contributes to the deflection by 
an amount which is 3 orders of magnitude smaller 
and can thus be neglected. 

With 1= nh (cf. Bohr model) we have from Sect. 

12.8 

B,= /= 12.54 T. 

4nr^mo 



12.3 
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12.4 



12.5 



In general (compare Sect. 12.2): 

J^i = 9 1/^B = • 

Electron: jji^ = = 9.274 • Am^ 

2 mo 

Muon: ju^imuon) = n^/201 = 4.480 • 10“^^ Am^ . 

In the case of positronium, due to the opposite 
charges and equal masses, no current flows and 
the magnetic moment is zero. 

«'/(/+!) (/+1) 

• C/(/+i)-/(/+i)-5(5+i)) 



with Fq = 



\6nmlal 



= 5.81 •10'^^J = 3.63-10"'‘eV 



n 


/ 


j 


F) 


leV] 


2 


1 


1/2 


-1/12 


-3.03 • 10“’ 






3/2 


1/24 


1.51 • 10‘^ 


3 


1 


1/2 


-2/81 


-8.96- 10"* 






3/2 


1/81 


4.48 • 10“* 




2 


3/2 


-1/135 


-2.69-10“^ 






5/2 


2/405 


1.79- 10"* 


30 


1 


1/2 


- 1/40500 


-8.96 - 10-® 






3/2 


1/81000 


4.48 • 10'® 




29 


28.5 


1 

O 

1 

00 


-1.57 • 10'*' 






29.5 


VO 

O 

1 

00 


1.52- lO"” 



12.6 The energy difference AE between the states: 

1 _ 1 
Ag Ag + 4lA 

= 1.101 • 10 “^‘’1 = 0.0687 eV 

with Ag = 852.1 nm, zl A = 422 nm. 

The fine-structure interaction energy (cf. Sect. 

12 . 8 ): 

F/, s = - Us- Bi= g^iiiB l/^C^-Hl) B, cos ( ^ {s, B,)) 




^AE= F,,s(/- = 3/2)-F,,s(/-=l/2) 

-^5/= 559.5 T, a = 7.339 • 10'^‘ J = 0.0458 eV . 



12.7 1) Energy states of the hydrogen atom as describ- 

ed by the Dirac theory (cf. Sect. 12.10): 

„2 



En,j = En + ^E„j = E„ + 



with E„ = -hcRii/n^ 



EnCt^ 



1 



iy-l-1/2 4n 



2) The Lamb shift lifts the degeneracy of the states 
with the same value of so that states with smal- 
ler / values have slightly higher energies. 



3) Selection rules for the possible transitions: 



Al = ± 1 ; Aj = 0, ± 1 . 

A total of 18 lines occur; see Fig. 12.20 (taking the 
lifting of the J degeneracy into account). 



12.8 a) - 



Efs n 

HcRZW 



1 



y+1/2 



An 



7max+l/2 An 



An 



>0 



with ymax = n-\/2. Then, for all n and y, we have 
E'fs < 0. 



b) In hydrogen-like atoms, the energies of the elec- 
tronic states depend only on n and j. For a given 
n, there are just n different j values: 1/2, 3/2, . . . , 
A? - 1/2. Thus, the terms for a? = 3 and aa = 4 are 
split into 3 or 4 levels, respectively. 



C) 4l V 


_ ^lEps _ _ 


a^RZ^f 1 


3 ' 
An ) 


1 


he 




Vy + l/2 


n 


3 3 


3 


4 4 


4 


4 


j 


1/2 3/2 


5/2 


1/2 3/2 


5/2 


7/2 


-A'v 


0.162 0.054 


0.018 


0.074 0.029 


0.013 


0.006 


[cm”'] 












(for Z 


= 1). 











= -^C/X/ + l)-/(/+l)-^(5+l)) 

2 

(For the calculation of cos(')c(5, B/)) see Sect. 

12 . 8 .) 



d) Selection rules: Al= ±\\ Aj = 0, ±1. 

The following table shows the possible transitions 
and the deviations in the positions of the 
lines as a result of the fine-structure interaction. 
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12.9 Fine structure (spin-orbit) interaction: 

= — [A/+l)-^(i + l)-S(S+l)] 

2 

The fine-structure constant ^ is a constant for a 
particular multiplet (L, S = constant). 

Selection rules: AL = (0), ±1 AJ=0, ±\ 

(for the case of ZIL = 0 see Sect. 17.3.2) 

(see figure on the right) 

12.10 a) A magnetic dipole always produces a magnetic 
field B (r) in its neighbourhood: 

4n \ ' / 

_ Ho ( 'i{H-r)r-nr^\ 

4;rl /-^ /■ 



Thus, for the interaction between two dipoles, 
E= -/i2-B](r) 

_ do H\ H2 3 iH\-r){H2-r) 

An P P 

E is always zero when H 2 -L 

b) If fi\ and /i 2 are parallel and 6 is the angle be- 
tween H\ or H 2 and r, we have 



E = -^ . 

An P 

Extreme values for fixed \r\ = r: 



0 = 0, n: 
e=n/2: 




c) Electron-electron: 

Hs = s=l/2-, Hb = - ; 0 s = 2 

2 mo 



a) Transition ¥ — — 




b) Transition 




Diagram for Solution 12.9 
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Angle 6 


Energy E 


Magnetic field \Bx (r) | 


0, n 


-6.451 • 10“^“ J 


0.402 T 


n/2 


3.226 - 10"^ J 


0.201 T 


Proton-proton: 


MN 9 

1836.15 


/ip = !//(/+!); /=i/2; 

0P = 5.585 


Angle 6 


Energy E 


Magnetic field \Bx (r) | 


0, n 


-1.492- 10“^^ J 


6.11 - 10“'‘T 


n/2 


7.46- 10“ J 


b 

o 

H 



13.1 V = = 2.8 • 10^ Hz = 2.8 GHz 

h 

13.2 *Du2- L = 2, S = 3/2, 7=1/2 

B Vector model, cf. Fig. 13.9 




The resultant vector fij for the magnetic moment 
is perpendicular to / and precesses around it. Only 
the component (jij)j which is parallel to / is mea- 
surable; however, it does not occur here (cf. Fig. 
13.9). 

Therefore, the term does not split in a 

magnetic field. 

13.3 0 = ^ (L,J) = 50,71^ 

The condition Vy 2 (i, l)-^- 3/2 (|, i) = AE 
gives 



7 //b \^1 ^2/ 

b) Di line: zl v = — = 3.73 • 10‘‘’Hz 

3 h 



10 



D 2 line: Zl v = — = 4.67 • 10‘® Hz. 

3 h 

13.4 a) The energy difference between the ^PV2 and 
^P \/2 States without a magnetic field is 



AE = 



he he 



Ai A 2 

Additional energy in a magnetic field: 



'Pv2:/=l,y = l/2 02 = 2/3 


(1) 


^mj (t > ~ T /^B B() iily 


%n-l=iJ = y2 gj = An 


(2) 





(cf. Sect. 13.3.4 and 5). 

Condition: V^Ci, 1) - = gives 

5o = -— (^—-—^ = 15.77 T . 

7 //b \^i ^ 2 / 

b) Di line: Zl v = = 3.73 • 10‘® Hz 

3 h 

D 2 line: zl v = = 4.67- 10'° Hz . 

3 h 

13.5 The extra term in the energy for the anomalous 
Zeeman effect: 

AE^ = gjH^Bnij; 

j(j + \) + s{s+\)-l{l+\) 



9j= 1+- 



Land6 factors: 



2JU+D 



gj^Dsn) = 6/5; = 4/5; ^ 3 / 2 ) = 4/3; 

gjePm) = 2n. 

Selection rules: Al= ±l,Zly = 0, ±1, 

Arrij = 0, ± 1 . 

1) '£>5/2 -"^3/2 transition: 

- 9 jCP 2/2) ■ rrij^Pin) 
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13.6 





my= my = 


II 


= m,= 


'”y = 


^^3/2 


5/2 3/2 


1/2 -1/2 -3/2 


-5/2 


rrij = 3/2 


1 -1/5 


-7/5 






my = 1/2 


17/lf 


i -1/15 -19/15 




my= -1/2 




19/15 


1/15 -17/15 




my= -3/2 






7/5 1/5 


-1 


2) ^T>3/2 ^ 


^P 3 / 2 -transition: 






A(hv) _ 








-gj&yi) ■ 










mj = 


^j = 


mj = 






3/2 


1/2 


-1/2 


-3/2 


mj = 3/2 


-4/5 


-8/5 






my = 1/2 


8/15 


-4/15 


-16/15 




my= -1/2 




16/15 


4/15 


-8/15 


my= -3/2 






8/5 


4/5 


3) ^£>3/2 


^Pi/ 2 - transition: 






A(hv) _ 
I^bB 


OjC’D^n) • tTi 








-gj&m) • 










mj = 


mj = 


rrij = my = 


^PU2 


3/2 


1/2 


-1/2 


-3/2 


my = 1/2 


13/15 


1/15 


-11/15 




mj= -1/2 




11/15 


-1/15 


-13/15 



a) No magnetic field: Each line in the Balmer 
series, neglecting the Lamb shift, consists of five 
individual lines. 



b) Weak magnetic field: Anomalous Zeeman ef- 
fect. 



Transition 


F - S Energy 
A(hv)Y-s 
Ea^ 


Number of 

Zeeman 

lines 


Total Zeeman 
splitting in 
fi^B 


n ^Dy 2 2^^372 


0.01254 


12 


2.8 


^ ^^3/2 ^1^3/2 


0.006366 


10 


3.2 


^ ^^3/2 2 Pj /2 


0.06887 


6 


1.73 


^ ^^3/2 2 ^ 5 j /2 


0.06887 


6 


3.33 


^ ^^ 1/2 2% /2 


0.05035 


4 


2.67 


n ^ 5 j /2 -► 2^ P \/2 


0.05035 


4 


2.67 


^ ^<^ 1/2 2 ^ P 3/2 


-0.01215 


6 


3.33 



(cf. also Problem 13.5). 



c) Strong magnetic field: In the Paschen-Back ef- 
fect, three lines separated by are found in the 
spectrum. The central line corresponds to the 
transition energy without spin-orbit coupling. 



Magnetic moments [from (13.17)] 



State 




lCu;)y|[Am2] 


^^\/2 


Vi 


1.61 • 10““ 


^Pi/2 


1 

7^ 

2]/i 


5.35 • 10““ 


^PV2 


2.40-10““ 


'^3/2 


2l/| 


1.44- 10““ 


^^5/2 


3/i 


3.29-10“^ 



Estimate of the transition from the Zeeman to the 
Paschen-Back region: 

ZIE'f-s^/jc-O.I cm"‘ 

thus, 5iin, » 0.2 T. 

B = 0 B = 0 B weak B strong 

with spin- Zeeman- Paschen- 

orbit coupling splitting Back effect 




13.7 a) According to Problem 13.6, the transition from 
the Zeeman to the Paschen-Back region occurs 
around B = 0.2 T, i.e. here we are in the region of 
the Paschen-Back effect. 

Note that is the deviation of the energy 

from that of the same state without fine-structure 
splitting. 
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Selection rules: Ami=0\ ±\ \ Am^ = Q, 

without coupling coupling 

B-field only with I with Sand I 



0 i 
-1 i 

1 -i 
0 

-i 



0 \ 



0 

In the spectrum, one observes three lines with a 
splitting of J V = 2.098 cm"^ 

c) Let be the energy splitting between the lev- 
els in the case of the Paschen-Back effect: 

eh 

AE = fi-gBQ = — Bq = hv 

2 mo 

then ( = 1 .76 • 10" C/kg 
\«io/ Bo 

(Literature value: e/mo = 1.7588 • lO" C/kg). 

d) Since the energy-level splitting due to the 
Paschen-Back effect is independent of n, the 
splitting of the first line in the Lyman series is the 
same in wavenumbers as that of the H„ line. Thus 

Zl A (Lyman) < A(H„) . 




Result: 

E„ = h(OLin + {) 

lei I5| 



OiL 



2 mo 



14.2 By substituting in the corresponding matrices, the 
given relations can be immediately verified. 

14.3 ]/c^p^-f-/Woc'‘ = «ioC^ j/l +p^/mlc^ 

« moC^(l -l-ypVmoC^) 

(Power series expansion of the square root termi- 
nated after the second term.) 



14.4 0^-{mo c/hf') [j/ = 0 (Klein-Gordon Equation). 
From this: 

If/* (D^- (awoc/^)^) y/ = Q . 

The complex conjugate expression is 
(^(□^-(ffJoc/^)^) y/* = Q . 

Taking the difference: 

\j/*(0‘--{moc/hf) y/- y/i^-{moc/hf‘) y/* - 0, 
and 

y/*[3^y/— y/O^y/* = 0 . 

With 

□^ = -L ) we have 



y/* \/^y/ — y/\7^ y/* 







aV 




= 0 . 



13.8 a), b): see Chap. 19 
c): «>5. 

14.1 The Hamiltonian is 



From this we obtain 



at 



%yi* 

y / — — — — y/* 

at 



Qy/ 



3/!/’= — (p + e^ 

2mo\ J 

2/Mo \ dy^ 



-l-div(i//* V y/- y/ V y/*) = 0 . 

Multiplying the preceding expression by h/2\mo, 
we obtain 

at \^2moc^ \ at at / 



+ 2eBx- — + e^B^x^ 
i ay 



-l-div 



2imo 



{y/* V y/— I// V y/*) 



= 0 , 
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or 



— ^ + div/ = 0 . 
dt 

14.5 (ih)^ — :^y/ = ih — Jfy/-Jif’ih—y/=Jf'^y/, 

^ jZ ^ ' 



ar 



dt 



dt 



dt 



{-h^c^V^ + mlc'^)y/ or 

^ \ mW 



14.6 Substitution into the Dirac equation yields 



-E+rrioC^ 

0 

chk 

0 

>r 

'Pi 



0 

-E+nioC^ 

0 

-chk 



chk 

0 

-E-nio c^ 
0 



0 

-chk 

0 

-E-moC^ 



Pi 



= 0 . 



Using the trial solutions ( IPj ,0, 0) and (0, 

0, P^), one can decompose this system of equa- 
tions. The corresponding energies are 



E = ±\/niQC^+c^h^k^ . 



15.1 Electric field strength = F 
n = \ level: 

y/= (pi-F4>o+\/^P<Pi 



2 772 



= 0 , E^^^ = ^ ^ 



2 • 



2mo(o 

n = 2 level: 

02 - 1 ^^ 01 + 1/3 F 03 



E^'^ = 0,E 



( 2 ). 



2 j~'2 

eF 

2mo(jj" 



,F= -eF/\/2mo(o^h 



15.2 Eigenfunctions and eigenvalues: 



OnW = 



1 



Jn<p 



1/2^ 



n = 0, ±1, ±2, ±3... 



El = 



h^n^ 



" Mod ' 



First-order perturbation theory: 
Ej, = 0 for n ± 1 



for n = ± 1 : E+ = — a, 0+ 



1 

y 



El = - — a, 6>_ 



1 

^TT 

i 



(e‘^ + e“‘«*) 
(e'^-e-*^) 



15.3 J^,id ^=lh(oMh + \)^ = E4> (1) 

A 

with obtain 

E h(D,{bt (fo) = Ebt 4>o. 

A 

Making use of b^b^ - bt'h = dw, it follows that 



hoj),' by 00+ Hbcoxbx by b^ 0 q 

A 

+ Y.hoj,{\)bt>0o = Ebt^o- 

A 

Since b^ 0q = have 

hcoyby 00+ X ba>x{Y)by 0o = Eby 00- 

A 

Then 0y obeys expression (1) under the condition 

that E = ho)y + Y ^ hcox- 
A 



15.4 N ihoj,b^ b,{bt r(b^ n . ,{b^r ^ 00 

A 

+ (N/2) E hco,ibtnb^)%.,(b^r^0o 

A 

= EN(bt r {bi T\ . . {b;,r^ 00 , (1) 

with N = l/]/^Zi! ni \ . . .n^l . 

Taking into account the commutation relations 
given, we obtain: 

Nh cui bi rii (bi )”' " ‘ (b2 (b^ )"^ 0q 

+ Nh 0 J 2 bt ri 2 (bt )"‘ ( 62 ^ ‘ • ip ^ )"~ «>o 

+ ...Nho)Nbt, nybt )"' {b2 

...ibt,)’'^-^ ^>o+N I hoi^bt (bt )"• (b^ )"2 

A 

. . . {by^'^b^ ^ o+(N/ 2) I hoj^ibt )"' 
x{by"\..(b^r'^<Po 
= EN{btT'{bir\..{b;,r'^4>o. 
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With h<l>Q = 0 and bUb^f-{Krbt = 0, it 
follows that 

Nlhcoxnx {bt )"‘ {bi f\..{bt,)''’^ o 
A 

+ {N/2) I h(0), {bt )"> (bi )"2 . . . {b ;^ )"^ ^0 

A 

= NE(bt )"' {b^ )"^. . . (btjfN 00 • 

Then 0 obeys relation (1) with the condition 
E= I^co;i«A + (i) 

A A 

15.5 \\i/*{r)p^ii/„{r)dr 

= I 1 Vti>’)pr\i/„'{r)dr\ y/*.{r')pr'i//n(r')dr' 
n 

= \[wtir)PA\Pr Vnir')\ I Vn’ir)wt’ir')]dr'dr 

n' 

= 1 W*n{r)pA\ \Pr Wnir')]d{r-r')dr'}]dr 
= 1 ¥t(>')PrPrWn(r)dr 

= l¥*(r)P^¥„ir)dr 



£(2) = 1_L_ 

3 8;t£o 



¥n(r)s-s 




¥n(r) 



= -J—<s-sy I ii/„(rfd{r)d^r 
6 Co 



6eo 



n 


2 


3 


4 


-E/hcR 


17/4 


37/9 


65/16 


£[eV] 


-57.76 


-55.87 


-55.21 


ZexpleV] 


-58.31 


-56.01 


-55.25 



With increasing n, the regions of high probability 
density of the two electrons move apart, so that 
the inner electron behaves like an electron in a 
He^ atom (Z = 2) and the outer electron behaves 
like that of a hydrogen atom (Z = 1). Then the 
theoretical and experimental values agree more 
closely. 



15.6 Rearrangements: ^ 

n' 

= E {n\p\n"y{n'\p\n)(El,-E^ 

= y E [{n\p\3e^in')(jn'\p\n) 

n' 

+ {n\p\n'){Jt^^n'\p\n)-{3ft^xn\p\n') 
x{n'\p\n)-{n\p\n'){n'\p\-^i\n)] 

= tI !«'><«' |p|«> 

n' 

-{n\p\n'){n'\[p,.^A]\n>] 



15.7 



Taylor expansion: 

<¥n(r) ¥nir)) = <¥nir) V(r) 



^ i,j 



e 

Atieo 



+ ... . 



¥nir)SjSj 



8 8 1 
8x,- QXj \r 



¥n(r)^ 



Furthermore, we have 



^SfSjy bj^j 2 ’S') , 

We thus have for the second-order term: 



17.2 7V=(2Z, + 1) (2S-t-l) is the number of different 
quantum-mechanical states of an atom with orbi- 
tal angular momentum L and total spin S. In Z - S 
coupling with S ^L, we have 

Y,{2J+i)= i (2{L + k) + i) = (2S+l)i2L + i) 

k= -S 

= N 

J=L-S...L + S . 

That is, the number of states is conserved in Z - S 
coupling. (The case L<S can be obtained by ex- 
changing Z and S.) 

17.3 a) Z - S coupling: 

/i = l, h = 2 Z = l,2, 3 

5j = 1/2, S2 =1/2 S = 0, 1 

Possible states: 



\l 


1 


2 


3 


5 \ 








0 




'Z)2 


’^=■3 


1 






^2 3^3 3^4 
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All 



Each term includes (2/+ 1) states; thus, there are 
altogether 60 states. 



b)y-y coupling: 



/j = 1,5j = 1/2 71 = 1/2,3/2 

/2 = 2, 52 =1/2 72 = 3/2,5/2 



Possible states: 

( 3 / 2 )-^= 1;2 (^^h^/=0;l;2;3 

(v 2)^/=2;3 (i;^)^7=l;2;3;4 



The same abundances of the different / values are 
found, and thus the same total number of states. 



17.4 a) Electron configuration of the excited C atom: 

C:l5^ 25^ Ip 3rf l2 = l, 52=1/2 

' ' ' /i = l, 5 i = 1/2 

I— j _ ^ A 

^core *^core 

Possible / and S values: L = l,2, 3;S = 0, 1 
Possible / values: /= 0, 1, 2, 3, 4 

Possible terms: ^^3 singlet S = 0 

^Po % triplet S = 1 

1 2 3 

2 3 4 

b) Occupation of the 3d subshell due to Hund’s 
rules (cf. Sect. 19.2): 



3/ 


t 


t 


t 






mi = 


-2 


-1 


0 


1 


2 



_L = 3; 5 = 3/2; J=3/2^%/2 
Magnetic moment [cf. (13.17)]: 



c) Occupation according to Hund’s rule (cf. Sect. 
19.2) gives for the ground state in 

Y:L = 2, 5=1/2, J=3/l-^'^Dy2 
Zr:L = 3, 5=1, /=2 , 



d) Electron configuration of Mn: 
Mn: \s'^ls^lp^3s^3p^ As'^3d^ 



3<f 


t 


T 


T 


T 


T 


mi = 


-2 


-1 


0 


1 


2 



Ground state: L = 0, 5 = 5/2, / = 5/2 ^ ^ 55/2 . 



17.5 a) General procedure: 

1) Determination of L and 5 from the term sym- 
bol; 

2) Determination of J from the number m of par- 
ticle beams in the Stern-Gerlach experiment: 
m = 2 /-h 1 ; 

3) Maximum magnetic moment in the direction of 
the magnetic field: 



/^z,max ~ 



0 y=l + 



/(7+l) + S(S+l)-L(£ + l) 
2/(7+ 1) 



Atom 


Term symbol 


L 


S 


J 




^^Z, max 
/^B 


Vanadium 




3 


3 

2 


3 

2 


2 

5 


3 

5 


Manganese 




0 


5 

2 


5 

2 


2 


5 


Iron 




2 


2 


4 


3 

2 


6 



b) For S = 0, the energy level splitting in a mag- 
netic field is given by: 

AE = E{mi= L)-E{mi= -L) = 2Lp^B; 



3, giving 'Fj . 





thus. 




^ AE hcv 

2p^B 2p^B 


18.1 


A™„ = 3.09-10-“ 


18.2 


AEco = hc/k = \. 



(Zco-ir 



)“‘^J = 6952 eV 

In comparison to the hydrogen atom, we have 
AE^ = / = 10.21 eV » 

IZ JL 

(cf. Moseley’s rule. Sect. 18.4). 

18.3 AA:^(Cu) = 1.55-10“’“m = 1.55A 

18.4 Manganese (Z = 25) 

18.5 Linear absorption coefficient p\ 

/(x) = /oe“"^; p = . 



X 
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Iq and I(x) are the intensities of the incident beam 
and of the beam after the distance x. From the ex- 
perimental data it follows that 

/i = (131.7±0.3)m“^ 

18.6 / = /oe~^^^x = — 



a)/zv[MeV] 


0.05 


0.3 


1 


H [1/m] 


8000 


500 


78 


X [m] 


2.88 


4.61 • 10“^ 


2.95 • 10-^ 


b)/iv[MeV] 


0.05 


0.3 


1 


///o 


00 

O 

yr\ 


8.21 • 10“^ 


0.677 



18.7 The half-absorption length of a material: 

In 2 



/ = 7o e = y / q; thus, c?i /2 = 



yM 



When Iq/I(x) is known, it follows for the thick- 
ness X that 



x/du2 = 



ln(/o/7) 
In 2 



/q// 


16 


20 


200 


X/dx/2 


4.00 


4.32 


7.64 



18.8 a) Photon energies of the K series: 

1 
n 



e„ = hv„ = hcR (Z- \f ( 4—^ ) • 



Energy levels: 



The velocity of the electron is calculated from the 
relativistic relation to be 




19.1 a) 15 electrons: phosphorous 
b) 46 electrons: palladium 

19.2 Consider an atom with several valence electrons 
having the same n, I quantum numbers, and ne- 
glect the spin-orbit interaction and interactions of 
the electrons among themselves. Then the total 
state of the valence electrons can be represented as 
a product of one-electron states. A one-electron 
state is characterized by the quantum numbers n, 
I, nil, ^nd ms. 

¥'tot = (1) 

The operators Lz and §z for the Z components of 
the total orbital angular momentum and the total 
spin are 

^z= + /z 2 +--- ; ^z = Sz^ + Sz^+ ... ( 2 ) 

If a shell n, I is fully occupied, all of the possible 
one-electron states (m/, m^) are filled just once, 
according to the Pauli principle. From (1), the fol- 
lowing wavefunction is obtained: 



'Ptoi= + 1/2 ( 1 ) • 1 / 2 ( 2 ) •... 

■ !7^-l, + i/2(47/ + 1) • _i/2(4L-I-2) . 

!Ptot is an eigenfunction of £2 and of Sz. The ei- 
genvalues are rriLh = 0 and rrish = 0. 

Therefore, the total orbital angular momentum 
and the total spin must vanish, i.e. 7. = S = 0. 



Shell 


K 


L 


M 


N 


Energy [eV] 


-69660 


- 10620 


-2280 


-390 



b) In order to excite the L series, an electron must 
be removed from the L shell, i.e. one requires at 
least the energy 

-Ei= - 10.62 keV. 

The wavelength of the L„ line is = 1 .49 A. 

18.9 E’kin = Ex- IE 2 = 5.57 • 10~ J = 34.8 keV. 



Note: the function ¥^tot in (1) does not yet fulfill 
the Pauli principle, since it is not antisymmetric 
with respect to exchange of two electrons. Strictly 
speaking, one would have to set up a determinant 
as in (19.18); this has no effect on the conclusions 
drawn above. 

19.3 Pauli Principle: 

The state of an electron in an atom can be speci- 
fied by four quantum numbers. If the spin-orbit 
coupling and the interaction with other electrons 
which may be present are neglected, the quantum 
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numbers n, I, mi, and may be used. The Pauli 
principle then requires that in a system containing 
several electrons, no two electrons may have 
exactly the same four quantum number values. 
This principle is based on the indistinguishability 
of electrons. 

If one uses the quantum numbers n, I, mi, and m^, 
then, taking the Pauli principle into account, the 
following two-electron states are found ( x) for an 
np^ configuration: 



Table 1 




(0, + ) 
( 0 ,-) 
(- 1 , + ) 
(- 1 ,-) 



X 

X 



In the following, we denote such a two-particle 
state by the expression m^^)(rrii^, m^^] with 
m/= +1, 0, -1 and m^ = ±(1/2). 

The coupling of two angular momenta: 

When two arbitrary angular momenta /j and J 2 
are coupled to a total angular momentum 
J = J\ + J 2 , the quantum number J can take on the 
values J= J^ + J 2 , . . . , |/i-/ 2 l* For each / value- 
there are the m j values mj= - J, . . . , +J. 

The corresponding total angular momentum state 
(J, m j) can be represented as a linear combination 
of the product states (/j, mj^) • (J 2 , mj^, where 
mj= mj^ + mj^is always fulfilled. 

{Jy ^j) — 1] ^Jl) 

... 

= ( 1 ) 



After this coupling, mj^ and mj^ are no longer 
valid quantum numbers. An angular momentum 
state is specified uniquely by the quantum num- 
bers J\, J 2 y Jy and m j. In both j - j and in L - S 
coupling, one arrives at the total angular momen- 
tum J of the system by means of such a two-stage 
angular momentum coupling, starting from the 
non-interacting particle model. 

a)y-y Coupling: 

In the case of j -j coupling, in the first step, the 
orbital angular momentum / and the spin 5 of each 



particle (electron) are coupled to a single-particle 
angular momentum y. Table 2 shows the ordering 
of all the states which are consistent with the Pauli 
principle from Table 1, according to the new 
single-particle quantum number mj. 




[( 1 , + )(-!, + )] [( 0 , + )( 0 ,-)] 

[( 1 , -)(-!, + )] 

[( 0 , +)(-!,+)] 

-3/2 [(l, + )(ri,_)] [(1, _)(_!,_)] [(0, _)(-!,_)] 
[( 0 , + )(-!,-)] [(- 1 , +)(-!,-)] 



mj= -2 



In the second step, the single-particle angular mo- 
menta are coupled to give the total angular mo- 
mentum J(j-j coupling). From (1), only those 
states contribute to a state with quantum number 
mj (belonging to J) for which mj^ + mj^ = mj. The 
number of such states is conserved, so the abun- 
dance of the various possible m j values may be 
calculated. 



mJ 


2 


1 


0 


-1 


-2 


Abundance 


2 


3 


5 


3 


2 



Here, the following quantum numbers J occur: 



J 


2 


1 


0 


Abundance 


2 


1 


2 



b) L-S Coupling: 

In the first step, the orbital angular momenta l\ 
and I 2 couple to give the total orbital angular mo- 
mentum L and the spins and S 2 couple to the to- 
tal spin S, Table 3 gives the ordering of the states 
which are consistent with the Pauli principle from 
Table 1, according to the new quantum numbers 
mg and 
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For the L - S coupling to a total angular momen- 
tum J we find, similarly to part (a): 





2 


1 


0 


-1 


-2 


Abundance 


2 


3 


5 


3 


2 



and from this: 



J 


2 


1 


0 


Abundance 


2 


1 


2 



Thus, the same / values occur with the same abun- 
dances. 



19.4 The given electron configurations are considered 
in the light of L - S coupling. If several electrons 
have the same n and / values, the Pauli principle 
must be observed (cf. Problem 19.3). 

a) Electron configuration ns\ 

Possible term: 2 states 

b) Electron configuration np^\ 

One finds the following abundances for the vari- 
ous (nil, pairs: 





3/2 


1/2 


-1/2 


-3/2 


2 


0 


1 


1 


0 


1 


0 


2 


2 


0 


0 


1 


3 


3 


1 


-1 


0 


2 


2 


0 


-2 


0 


1 


1 


0 



leading to the possible terms: 



L 


S 


Terms 


Number of states 


2 


1/2 


^Z>3/2 ^T>5/2 


10 


1 


1/2 


'A/2 '^3/2 


6 


0 


3/2 


"^-^3/2 


4 



c) Electron configuration np^n's: 
Possible terms: 


L s 


Terms 


Number of states 


2 1/2 


^^3/2 ^^5/2 


10 


1 3/2 


"^1/2 "^3/2 "^5/2 


12 


1 1/2 


"PU2 ^P3/2 


6 


0 1/2 


^S'l/2 


2 



d) Electron configuration np^\ 

Possible terms: L = \\ S= 1/2: ^P \/2 6 sta- 

tes 

e) Electron configuration nd^ n'p: 

Possible terms: 



L 


5 


Terms 


Number of states 


5 


1/2 


"Hu /2 


22 


4 


1/2 


2 ^ (^^ 7/2 ^^9/2) 


36 




3/2 


^^5/2 ^^ 7/2 ^^ 9/2 "^^ 11/2 


36 


3 


1/2 


ixeF5/2"Fi/i) 


42 




3/2 


‘'F3/2 ''F5/2 *Fj/2 ‘'F9/2 


28 


2 


1/2 


3xeD2/2^D,,2) 


30 




3/2 


2x (^Du 2 *Dy2 ‘'O5/2 ‘'07,2) 


40 


1 


1/2 


3x ePu2 "P 3 / 2 ) 


18 




3/2 


^Pu2 ^Pi/2 %/2 


12 


0 


1/2 


^■ 51/2 


2 




3/2 


"'■53/2 


4 
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f) Electron configuration nd n'd: 
Possible terms: 



According to Hund’s rules, for the ground state 
term we haveL = 3, 1, /= 2; i.e. ^^ 2 . 



L 


S 


Terms 


Number of states 


4 


0 


'G, 


9 




1 


^G4 ^Gj 


27 


3 


0 


'^3 


7 




1 


^2 ^3 ^4 


21 


2 


0 


'O 2 


5 




1 




15 


1 


0 


'Px 


3 




1 


5/>0 5P2 


9 


0 


0 


’5o 


1 




1 




3 



Element 


Electron configuration 


Ground state 






L 


s 


J Term 


Silicon 


l5^ 25^ 


1 


1 


0 3Po 


Chlorine 


2i^ 2p* 3s^ 3p^ 


1 


1 

2 


3 2p 
T ^3/2 


Arsenic 


l5^ 2s^ 2p^35^ 
3p*3rf'®45^ 4p^ 


0 


3 

2 


3 4o 
T *^3/2 



19.6 



Element 


Electron configuration 


Number of 
unpaired 


Ground 

state 


Sulfur 


2s^ 2p*35^ 3p'‘ 


2 


"Pi 


Calcium 


2s^ 2p*3s^ 3p* 4s^ 


0 


'S, 


Iron 


25^ 2p^5^ 
3p*3rf* 45^ 


4 


'A 


Bromine 


l5^ 2s^2p‘3s^ 3p* 
3rf“’ 4s^ 4p5 


1 


2 p 

^3/2 



19.7 Electron configuration: nd^ 

Possible terms (cf. Problems 19.3 and 19.4): 



L 


S 


Terms 


Number of states 


4 


0 


‘G4 


9 


3 


1 


^Fl ^P3 % 


21 


2 


0 


'^2 


5 


1 


1 


3Po ^^2 


9 


0 


0 


'So 


1 



19.8 Proof of relation (19.25): 

0t(2) = (<7^,1 + 0 -^, 2 ) 0t(l) 0t(2) 

= (|^ + |^)0t(l)0t(2). 

The proofs of relations (19.26) and (19.32) are 
analogous. To prove relations (19.35 - 38), we re- 
quire the following identities: 

(7i + 2(Ti 

<Ti • (72 = ^x, 1 

<^x,2+ + ^-,2+ ^-,1 ^ + , 2 ) • 

With these, it follows that 

27^ ~ (TiH-(724"2 \ (7^^ 2"^^ + ,1^-,2"^^-,1^ + ,2 
and the following relations are valid: 

(T?0t(l) = \h(t>y{\), a\<Pi{\) = ih0i{\y, 

= 0, <i+,i0i(i) = h(t)]{\y, 

<T_,i0i(l) = 0 , <T_,i0t(l) = 

(similar relations hold for the second electron.) 
With these, one can prove (19.35 - 38)! 

19.9 When the quantum numbers qx and ^2 ^re identi- 
cal, no triplet state exists. 

[See (19.19, 20 and 30)] 



20.1 The formula for hyperfine structure (20.10) is 

AE^ = — [F(F+ !)-/(/+ 1) - /(/+ 1)] 

2 

The interval rule (20.12) states that 

A Ey +1 — A Ey — Cl {F 4“ 1 ) . 

From this, F^ax and a may be determined to be 

^max = 7 a = (0.0789 ±0.0006) cm 
and F=2, . . . , 7 so that I = 9/2. 



F 


2 


3 


4 5 6 


7 


AE^/a 


-13.75 


-10.75 


-6.75 -1.75 4.25 


11.25 
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Solutions to the Problems 



20.2 The hyperfine structure formula (20.10) is 

AE^ = — [F(F+\)-I{I+\)-J{J+\)]\ 

2 

^ Bj 

K^(/+l) 

In the hydrogen atom, (20.11) is valid: 

2 2 2 1 
« = — y«0 9if^N I W(0) I ; I y/(0) I = — 3 -^ . 

(Here, r^i is the first Bohr radius.) Then 

Bj = tMo 9e I ¥(0) P !//(/+!). 

According to the interval rule, the energy differ- 
ence between parallel (F=l) and antiparallel 
(F = 0) orientations of the electronic and the pro- 
ton spins is just a. 



n 


|v/(0)p[l/m’] 


Bj[T] 


a [cm ^ ] 


1 


2.15 • 10^® 


29.0 


0.0475 


2 


2.69 • 10^^ 


3.63 


0.00594 


3 


7.96 • 10^* 


1.07 


0.00176 



20.3 fl//!C = (5.85-^0.52) • 10“^cm“‘ 

By = (3.03 ± 0.27) T 

20.4 Zeeman energy of the electron: 

Ee- =gefiBBonts = 5.5T ■ W~^5 ■ 

= 3.48 • 10"^ eV -m.^hc- 0.281 cm“‘ • m,. 

Hyperfine interaction energy with 
a = 0.0475 cm“*: 

Fhfs = artijiris, = he • 0.0475 cm“^ • mitn^ 

= 5.89-10"^eV- W/Ws 
= 9.44-10“^5j-m/ms. 

Zeeman energy of the nucleus (^/ = 5.585): 

Fp= -8.46 • 10"^^ J • mj 

= -5.28-10'*eV-m^=/ic-4.26-10“'‘ 
cm“* • nil. 

20.5 y = 6.624 • lO'' mVVs^ 

0/=1.38 

h-z.max. _ 3 45 

Bn 



20.6 According to (20.15), in the case of a “strong” ap- 
plied magnetic field we have 

^^HFS = 07 Bb rnjBi + amjnij. 

In the Stern-Gerlach experiment, the beam splits 
into (2/-I-1) (27-1-1) sub-beams; thus, we have 
/(^^Na) = 3/2. 

In the case of a “weak” magnetic field, the beam 
is also split into 8 sub-beams, which, however, 
have a different geometrical arrangement. 

21.1 IF= 1.224- 10“'^s“‘ 

21.2 





1 cm 


10 cm 


R = 99% 


% = 3.336 - 10“’ s 


= 3.336- 10“ S 


R = 90% 


= 3.336- 10“ ‘®s 


ro = 3.336- 10“’ s 


R = 10% 


/o = 3.706- 10“ " s 


?o = 3.706- 10“ '®s 




100 cm 




R = 99% 


/o = 3.336- 10“’ s 




R = 90% 


?o = 3.336 - 10“*s 




R = 10% 


?o = 3.706- 10“’ s 





21.3 B = 90%,L = 100cm: 

The number of photons decreases to 1/e of its ini- 
tial value after a time T = 3.336 • 10"^ s. 

21.4 The laser condition is 

N 2 -N 1 Snv^Avz 

— ^ • 

V c^to 

The formula 



gives the lifetime of the photons in the resonator. 

For the length of the resonator, we assume e.g. 
10 cm, and thus obtain the value 1.2 • lO^Vcm^ for 
the critical inversion density. 

22.1 Inserting (**) into (*) and taking into account 
the rules given yields equations of the form 
(14.87) and (14.88), which can then be treated 
further as in Chap. 14. 
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23.1 One requires only the orthogonality of the wave- 
functions. 

23.2 (^(1,2) 

= i)l + ^ 2 | T)l)(^ll i) 2 +^ 2 l 1 ) 2 ) 

+ (^2U)l-<lT)l)(^2|i)2-^l|T)2)} 

= ^{|rf.PlU) + l^2hTT) + |^2hU) 

+ l(^iPlTT)} 

= + I TT)}- 

23.3 We make use of the spin formalism and com- 
pute: 

U(i \b ~ (i Ia(T 
(I T \a)\ Db ~ I i Ia)I T)^) 

= ^((TUITU)U Ul iW 

- ((T i L)(i lsi>;| T),) 

“ ((i Ia^«I T Ia)(T Ib^;| 1)5) 

+ ((i Ia^iI i Ia)(T T)fi)- 

Owing to the similar structure of the A and B ex- 
pressions, it is sufficient to calculate those for A 
only. We obtain 

(T Ia^/I T Ia) = 

= a/x(T Wx\ T) + a/j(T ky| T) + a/z(T Wz\ T) 

~ ^iz 

due to the properties of the spin matrices. 
Similarly: 

(i {aH Da = -aiz 

(T i)^ = «« + io-iy 

(i U«/l T)a = - *%• 

Inserting these results and the corresponding 
ones for the matrix elements of bj yields (23.25) 
directly. 

23.4 1) = I t ),^2 = 1 i) are orthonormalised. 

2) All vectors v in the Hilbert space can be ex- 



pressed as V = (Z€i -f Pc 2 , where |a|^ + \pf' = 1. 
Due to 1), these vectors are also normalised. 

23.5 The photon plays the role of controller. 

24.1 Transform the coordinates according to 



Xb= -Xa, X„>0. 




V/(x) = Ae'^^ 


X< -Xa 


H^) = 5e'^^+Ce-'^^ 


-Xa<X< -Xa 


\!/{x) = £)e“'^^ 


X>Xa 


K = ]/2mb\E\/h^ 




Symmetric state: y/ix) 


= Wi-x). 



K-ntaP/h^ = Q 

B = C,A=D 

Antisymmetric state: [j/{x) = - 
K-moP/h^ = -e^^'^^‘'moP/h^ 

B= -C,A = -D 

24.2 Transform the coordinates, so that x*= -x„, 

Xq > 0 . 

\!/{x)=Aq'^^ X<-Xa 

i^(x) = 5e'^^-l-Ce“'^^ -x<,<x<-Xa 

\!/(x) = X>Xa 

K = \/^o\E\/h^ 

(k- moPa/h^){K- moPb / = t~'^’^"‘‘-p„Pbml/h* 
{C/Bf = Pa(K- moPb/h^)/(Pb(K- moPa/tP')) 

24.3 Wavefunction: 

for na<x< (n-l-l)a 

A;t = 5Ae‘*‘'-e-'^")(e'^"-e‘*“) 

K = ]/2mo\E\/!t^ 

. _2n 2 t^ ^ 2n 

6 6 6 

Auxiliary equation for k: 

, , sinh/cflr „..2 

cos ka = cosh kg mop/h . 

K 
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Fundamental Constants of Atomic Physics 
in the International System of Units (SI) 


Permeability Constant 




= 4jr-10“’VsA*'ra“‘ 


of Vacuum 




= 1. 256637... 


Permittivity Constant 




= (A<oC^)“' 


of Vacuum 




= 8.8541878 ...- 10" As V-‘ m“ ' 


Velocity of Light in Vacuum 


c 


= 2.99792458- 10® ms 


Boltzmann’s Constant 


k 


= 1.380658- 10“^® JK“' 


Faraday Constant 


F 


= 9.6485309- 10'‘Cmor‘ 


Elementary Charge 


e 


= 1.6021773- 10" '*C 


Rest Mass of the Electron 


rtiQ 


= 9. 1093897- 10" kg 


Specific Charge of the 


e/niQ 


= 1.75881%2-10"Ckg ‘ 


Electron 


Rest Mass of the Proton 


mp 


= 1.6726231 -10“^^ kg 


Planck’s Constant 


h 


= 6.626755- 10" ®'‘Js 




h 


= h/2n = 1.0545887- lO"®'* Js 


Rydberg Constant 




= 1.0973731534- 10® m“' 


Bohr Radius 




= 0.529177249- 10“ ‘“m 


Bohr Magneton 




= 9.2740154-10“" Am® (= J/T) 


Nuclear Magneton 




= 5.0507866- 10“®® Am® 


Compton Wavelength of 


Ac 


= 2.42631058- 10" '®m 


the Electron 


Fine Structure Constant 


a 


= 7.29735308-10“® 


Avogadro’s number 




= 6.022045- 10®® mol“' 


Energy Conversion Table see inside back cover 



Energy Conversion Table 




J 


eV 


cm ^ 


K 


1 Joule (J) = 
1 eVolt (eV) = 
1 cm”' = 

1 K 


1 

1.60219- 10 

1.98648-10“^^ 

1.38066-10“^* 


6.24146 ■ 10'® 

1 

1.23985 ■ 10“'* 
8.61735 • 10“* 


5.03404 • 10“ 
8.06548 • 10* 

1 

6.95030-10“' 


7.24290 • 10“ 
1.16045 ■ 10“ 
1.43879 
1 



Explanation 

The energy E is quoted in Joule (J) or watt-seconds (Ws) 

IJ = 1 Ws , 

In spectroscopy, one frequently quotes the term values in wavenumbers 
v^E/hc, 

The conversion factor is 

E/v = he = 1.98648 - 10“® J/cm“' . 

Another energy unit, especially in collision experiments, is the electron volt 
(eVolt, eV). The voltage V is given in volts, and the energy conversion factor is 
obtained from E = eV: 

E/V= e = 1.60219 • lO"*’ J/V . 

In the case of thermal excitation with the heat energy kT^ the absolute tem- 
perature is a measure of the energy. From E — kT^^ obtain the conversion 
factor 

E/T= k = 1 .38066 • 10“^ J/K . 



